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Abstract 

We  propose  a  theory  of  large  movements  in  stock  market  activity.  Our  theory  is  motivated 
by  growing  empirical  evidence  on  the  power-law  tailed  nature  of  distributions  that  characterize 
large  movements  of  distinct  variables  describing  stock  market  activity  such  as  returns,  volumes, 
number  of  trades,  and  order  flow.  Remarkably,  the  exponents  that  characterize  these  power 
laws  are  similar  for  different  countries,  for  different  types  and  sizes  of  markets,  and  for  different 
market  trends,  suggesting  that  a  generic  theoretical  basis  may  underUe  these  regularities.  Our 
theory  provides  a  unified  way  to  understand  the  power-law  tailed  distributions  of  these  variables, 
their  apparently  universal  nature,  and  the  precise  values  of  exponents.  It  links  large  movements 
in  market  activity  to  the  power-law  distribution  of  the  size  of  large  financial  institutions.  The 
trades  made  by  large  financial  institutions  create  large  fluctuations  in  volume  and  returns.  We 
show  that  optimal  trading  by  such  large  institutions  generate  power-law  tailed  distributions  for 
market  variables  with  exponents  that  agree  with  those  found  in  empirical  data.  Furthermore, 
our  model  also  makes  a  large  number  of  testable  out-of-sample  predictions. 
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1     Introduction 


Even  ex  post,  stock  market  fluctuations  are  very  hard  to  explain  by  movement  in  fundamentals. 
Trading  per  se  seems  to  move  prices.  This  leads  to  excess  volatility  of  stock  market  prices.  Even 
crashes  seem  to  happen  for  no  good  reason.  Trading  volume  is  very  high,  and  its  large  variations 
are  also  hard  to  relate  to  fundamentals.  The  present  paper  wishes  to  present  a  theory  of  those  large 
movements  in  trading  activity^ . 

For  this  is  useful  to  have  precise  empirical  facts.  Otherwise,  theories  are  too  unconstrained.  This 
is  why  we  use  a  series  of  sharp  empirical  facts,  established  using  dozens  of  millions  of  data  points. 
They  are  quite  precise,  hence  they  constrain  any  theory  of  large  movements  of  stock  market  activity. 
We  first  outline  some  of  those  key  empirical  regularities  before  presenting  our  theory. 

1.1     The  power-law  distribution  of  returns,  volume,  number  of  trades 

Our  theory  is  motivated  by  the  following  empirical  findings  on  the  power  law  distribution  of  (i) 
returns,  (ii)  volumes,  and  (iii)  the  number  of  trades,  (iv)  the  power  law  of  price  impact,  and  (v)  the 
power  law  distribution  of  the  size  of  large  investors. 

(i)  The  power  law  distribution  of  returns.  Let  Pt  denote  the  price  of  a  stock  or  an  index,  and 
define  return  over  a  time  interval  At  as  r^  =  \n(Pt/Pt-At)-  Empirical  studies  (Gopikrishnan  et  al. 
(1999),  Plerou  et  al.  (1999))  show  that  the  distribution  function  of  returns  for  1000  largest  U.S. 
stocks  and  several  major  international  indices  decays  as 

P  (|r|  >  x)  ~  ^with  Cr  ^  3  (1) 

Here,  ~  means  asymptotic  equality  up  to  numerical  constants^ .  This  holds  for  positive  and  negative 
returns  separately. 

Figure  1  shows  that  Pd?"!  >  a;)  on  a  bi-logarithmic  scale  displays  a  linear  asymptotic  behavior 
with  slope  —(r  =  —3.1.  Since  linear  behavior  on  a  log-log  plot  means  a  power-law  decay,  we  conclude 
that  lnP(|r|  >  a;)  =  —3.1  In  a;  -I-  constant,  i.e.  (1).  There  is  no  tautology  that  implies  that  this 
graph  should  be  a  straight  line,  or  that  the  slope  should  be  -3.  A  Gaussian  would  have  a  concave 
parabola,  not  a  straight  line.  The  remarkable  nature  of  the  distribution  (1)  is  that  it  holds  for  stock 
indices  and  individual  stocks  different  sizes,  different  time  periods  (see  Section  2  for  a  systematic 
exploration),  and  we  always  find  Cr  —  3.  In  the  following,  we  shall  refer  to  Eq.  (1)  as  "i/ie  cubic  law 
of  returns" . 

(ii)  The  power  law  distribution  of  trading  volume.  We  call  volume  the  number  of  shares  traded^ . 
We  find  that  the  distribution  is: 

P(g>x)~^  with  C,=i  3/2.  (2) 


a; '■I 


The  finding  holds  both  for  the  volume  of  individual  trades  and  aggregate  volumes: 

P(g>a:)~-^withCQ:^3/2.  (3) 


A  condensed  version  of  some  elements  of  the  present  paper  appeared  in  Gabaix  et  al.  (2003a). 
"  Formally  /(x)  ~  g  (x)  means  /  (x)  /g  (x)  tends  to  a  positive  constant  (not  necessarily  1)  as  x  — »  oo. 
The  dollar  value  traded  yields  very  similar  results.    This  is  expected,  as,  for  a  given  security,  the  variations  of 
number  of  shares  traded  and  the  volume  traded  are  essentially  proportional. 
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Figure  1:  Empirical  cumulative  distribution  of  the  absolute  values  of  the  normalized  15  minute 
returns  of  the  1000  largest  companies  in  the  TAQ  database  for  the  2-yr  period  1994-1995.  This 
represents  12  million  observations.  In  the  region  2  <  x  <  80  we  find  an  OLS  fit  lnP{r  >  x)  = 
—(^r  In  x+b,  with  (^r  =  3.1±0.1.  This  means  that  returns  are  distributed  with  a  power  law  P  (r  >  x)  ~ 
x~^''  for  large  x. 


We  had  initially  found  this  in  U.S.  data  (Gopikrishnan  et  al.  2000),  and  we  have  recently 
confirmed  the  same  on  French  data  (Plerou  et  al.  2003).  Figure  2  illustrates  this.  It  shows  the 
density  satisfies  p  {q)  ~  g~^'^,  i.e.  (2).  The  exponent  of  the  distribution  of  individual  trades  is  close 
to  1.5.  In  the  following,  we  refer  to  Eq.    (2)-(3)  as  the  "half-cubic  law  of  trading  volume". 

As  before,  the  exponents  describing  these  power-laws  seem  to  be  stable  across  different  types  of 
stocks,  different  time  periods  and  time  horizons  etc.  (see  Section  2). 

(iii)  The  power  law  distribution  of  the  number  of  trades.   A  similar  analysis  for  the  number  of 

trades  shows  that 

1 


P{N>x) 


r.CN 


with  Ca 


3.3 


(4) 


(iii)  The  power  law  of  price  impact. 

Building  on  the  concave  nature  of  the  impact  functions  in  stock  markets  (Hasbrouck  1991,  Has- 
brouck  and  Seppi  2001,  Plerou  et  al.  2002),  we  will  give  evidence  that  the  price  impact  Ap  of  a 
trade  of  size  V  scales  as: 

Ap  ~  V^  with  7  ~  1/2.  (5) 

(v)  Power  law  distribution  of  the  assets  of  large  investors.  Our  empirical  analysis  of  the  size 
distribution  of  mutual  funds  shows  evidence  for  a  power  law  distribution.  As  Section  2.6  reports  the 
number  of  funds  with  size  (asset  under  management)  greater  than  x  follows: 


P{S>x) 


cCs 


with  Cs  —  1 


(6) 


We  will  present  a  theory  where  those  five  facts  are  intimately  related.  It  is  largely  orthogonal  to 
the  existing  finance  theory.  Indeed,  much  of  finance  is  about  the  risk  premia  commanded  by  various 
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Figure  2:  Probability  density  of  the  individual  trade  sizes  q  for  30  large  stocks  in  the  Paris  Bourse 
from  January  1995  to  October  1999.  This  represents  35  million  observations.  OLS  fit  yields  a  density 
~  g-Ci+C,),  for  Q  =  1.5  ±  0.1.  Source:  Plerou  et  al.  (2003). 


shocks,  but  not  about  the  origins  of  the  shocks  themselves.  For  instance,  if  rMt  is  a  market  return, 
and  £it  is  an  idiosyncratic  return,  a  typical  finance  model  explains  why  an  asset  i  will  have  a  return 
of  the  factor  type: 

Tit  =  PirMt  +  £it 

what  causes  variations  in  the  loadings  /3j,  or  even  what  the  relevant  factors  tm  should  be.  In  contrEist, 
our  goal  here  is  not  to  explain  risk  premia,  but  rather  to  propose  a  structured  theory  for  the  origin 
of  the  shocks  rMt,  £it-  For  this  we  are  guided  by  the  regularities  in  their  distribution  and  in  the  way 
they  relate  to  other  market  quantities  such  as  volume  and  volume  "imbalance" .  In  other  words,  we 
wish  to  theorize  on  the  origins  of  those  movements,  especially  to  the  extent  that  they  cannot  easily 
be  related  to  fundamentals. 

1.2     Outline  of  the  theory 

In  our  theory,  large  volumes  and  large  returns  are  created  by  the  trades  of  large  investors.  To 
see  this  qualitatively,  suppose  that  a  large  investor  decides  to  trade  in  a  stock  due  to  some  news 
announcement,  or  some  recent  analysis  he  made  himself.  For  a  sufficiently  large  investor,  the  desired 
quantity  of  the  particular  stock  will  be  comparable  to  the  daily  turnover'' .  So  if  he  executes  the 
trade  quickly  he  will  suffer  a  large  price  impact.  However,  the  alternative  of  performing  the  trade 
very  slowly  over  time  is  not  attractive  either  since  the  mispricing  cannot  be  expected  to  remain 
indefinitely,  so  he  does  not  wish  to  wait  too  long  before  realizing  the  trade.  We  show  that  under 
plausible  conditions,  his  "optimal"  trading  behavior  generates  power  law  exponents  for  returns, 


'Please  see  Section  4.6.2  for  empirical  support  of  this  assertion. 


volumes  and  number  of  trades,  i.e.  the  exponents  Cq  =  d?  =  3/2  for  the  volume,  and  Cr  =  Cw  =  3 
for  the  return  and  number  of  trades. 

Section  2  presents  the  data  and  our  empirical  findings  in  more  detail.  The  provide  a  self-contained 
tour  of  the  empirical  literature  on  power  laws.  For  modularity's  sake,  we  chose  to  present  the  model  in 
two  steps.  In  Section  3,  we  present  the  model  while  assuming  the  microstructure  part,  i.e.  assuming 
a  power  law  price  impact.  We  show  how  his  generate  the  above  power  laws^.  In  Section  4,  we  present 
one  possible  model  for  the  square  root  price  impact  of  trade.  Section  5  gathers  the  out-of-sample 
predictions  of  our  model  and  our  empirical  results  for  some  of  these  predictions.  Section  6  discusses 
our  finding  in  the  context  of  the  relevant  literatures.  Section  7  presents  concluding  remarks  and 
directions  of  future  work. 

2     The  empirical  facts  that  motivate  our  theory 

2.1  Methodology  to  estimate  poAver  laws 

There  are  two  basic  methodologies  to  estimate  power  laws  exponents.  We  illustrate  them  with 
the  examples  of  returns.  In  both  methods,  one  selects  a  cutoff  of  returns,  and  orders  the  top  n 
observations  as  rn)  >  •  • .  ^  ^(ra)i  ^P  ^o  ^  certain  cutoff. 

The  simplest  method  is  to  do  a  "log  rank  log  size  regression" ,  where  an  estimate  of  C,  is  the  the 
OLS  coefficient  on  r(i)  in  the  regression  of  log  of  the  rank  i  (or  of  cumulative  frequency  i/n)  on  the 
log  size:  ^ 

Ini/n  =  A-  C°^'^ Inr(i)  +  noise 

This  method  is  the  simplest,  and  yields  a  visual  goodness  of  fit  with  the  power  law.  This  what  we 
do  for  instance  in  Figure  1. 

A  second  method  is  Hill's  estimator 

^Hill  _  n-1 

Er=i^(lnr(i)-lnr(„)) 

Those  two  methods  have  pitfalls.  In  particular,  they  underestimate  the  true  standard  errors.  Some 
of  those  pitfalls  are  discussed  in  Embrechts  et  al.  (1997,  330-345)  and  Gabaix  and  loannides  (2004). 
Most  of  the  available  statistical  literatrure  assumes  i.i.d.  draws.  In  contrast,  there  is  a  clear 
autocorrelation  of  volatility  and  trading  activity  in  financial  data.  This  does  not  bias  our  estimates 
of  the  unconditional  distributions,  but  makes  the  assessment  of  the  standard  errors  more  diflacult. 
Before  the  statistically  rigorous  theory  of  how  to  proceed  is  developed,  our  pragmatic  solution 
is  to  use  a  large  amount  of  data  -  several  million  observations.  The  stability  of  the  estimates 
across  different  periods,  countries  and  classes  of  assets  give  us  anyway  confidence  that  the  empirical 
estimates  we  report  here  are  robust. 

2.2  Normalizations  used  to  miake  different  assets  comparable 

To  compare  quantities  across  different  stocks,  we  normalize  variables  such  as  r,  V  and  A''  by  the 
second  moments  in  they  exist,  otherwise  by  the  first  moments^.  For  instance,  for  a  stock  i,  we 
consider  the  returns  ru  =  {ru  —  ri)  /ur^i-,  where  rj  is  the  mean  of  the  rn  and  cr^.i  is  their  standard 


^Technical  extension  of  the  present  paper  are  available  in  Gabaix  et  al.  (2003b). 
''This  is  explained  more  systematically  in  Appendix  F. 
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Figure  3:  Zipf  plot  for  the  daily  fluctuations  in  the  Nikkei  (1984-97),  the  Hang-Seng  (1980-97), 
and  the  S&P  500  (1962-96).  The  apparent  power-law  behavior  in  the  tails  is  characterized  by  the 
exponents  Cr  =  3.05  ±  0.16  (NIKKEI),  Cr  =  3.03  ±  0.16  (Hang-Seng),  and  Cr  =  3.34  ±  0.12  (S&P 
500).  The  fits  are  performed  in  the  region  \r\  >  1.  Source:  Gopikrishnan  et  al.  (1999). 


deviation.  For  the  volume,  which  has  an  infinite  standard  deviation,  we  consider  the  normalization: 
Qit  =  QitlQii  where  Qi  is  the  mean  of  the  Qn. 

2.3     The  cubic  law  of  price  fluctuations:  Cr  —  3 

The  cubic  distribution  of  returns  has  been  uncovered  in  a  series  of  studies  (Lux  1996,  Gopikrishnan 
et  al.  1999,  Plerou  et  al.  1999) .We  show  here  the  robustness  of  the  power-law  exponent  C,r  ~  3. 

It  seems  to  hold  internationally.  We  analyze  data  for  10  other  developed  stock  markets^.  We 
find  a  mean  of  C,r  =  2.9  with  a  standard  error  of  0.10.  Figure  3,  for  instance,  shows  how  the  return 
distribution  of  three  different  countries  are  very  similar. 

The  cubic  law  predicts  the  existence  of  a  few  very  large  returns  over  a  century.  In  Gabaix  et  al. 
(2003c)  we  show  that  in  the  U.S.,  stock  market  crashes,  including  the  1929  and  1987  crashes,  are  not 
outliers  to  the  cubic  law.  Hence  a  theory  of  the  cubic  law  of  returns  might  be  a  theory  of  crashes. 

Mandelbrot  (1963)  argued  that  financial  return  followed  a  L6vy  stable  distribution,  and  offered 
the  theory  that  this  might  come  from  a  large  number  of  shocks  with  infinite  variance.  Levy  distri- 
bution imply  that  C,r  <  2.  But  our  data  are  inconsistent  with  C,r  <  2,  hence  we  are  lead  to  reject 
Mandelbrot's  "stable  Paretian  hypothesis"  (Fama  1963). 

Having  checked  the  robustness  of  the  C  =  3  finding  across  different  stock  markets,  we  look  at 
firms  of  different  sizes.   Small  firms  have  a  higher  volatility  than  big  firms,  as  is  verified  in  Figure 


^We  took  the  stock  market  indices  of  Australia  (1/1/73-11/27/01)  (1/2/84-11/17/01),  Canada  (1/1/69-11/20/01), 
France  (1/2/84-H/27/01),  Germany  (1/1/73,  10/18/01),  Japan  (1/1/69-11/30/01),  Hong-Kong,  (1/1/73-11/30/01), 
Netherlands  (1/1/80-11/27/01),  South  Korea  (1/7/75-11/27/01),  Spain  (1/2/74-11/27/01),  and  United  Kingdom 
(12/25/68-11/20/01).  We  calculated  the  Hill  estimator  for  the  extreme  1%  of  the  distribution. 
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Figure  4:  Cumulative  distribution  of  the  conditional  probability  P{r  >  x)  of  the  daily  returns  of 
companies  in  the  CRSP  database,  1994-5.  We  consider  the  starting  values  of  market  capitalization 
K,  and  define  uniformly  spaced  bins  on  a  logarithmic  scale  and  show  the  distribution  of  returns  for 
the  bins,  K  e  (10^106],  K  G  (10^10■^],  K  €  (10^,10^],  K  G  (10^109].  (a)  Unnormalized  returns 
(b)  Returns  normalized  by  the  average  volatility  aK  of  each  bin.  The  plots  collapsed  to  an  identical 
distribution,  with  (^r  =  2.70  ±  .10  for  the  negative  tail,  and  C,t  =  2.96  ±  .09  for  the  positive  tail. 
Source:  Plerou  et  al.  (1999). 

4(a).  But  the  same  figure  also  shows  a  similar  slope.  Indeed,  when  we  normalize  the  distribution  by 
a  common  standard  deviation,  we  see  that  the  plots  collapse,  and  the  exponents  are  very  similar, 
around  C  =  3  again. 

Some  studies  have  quantified  the  power  law  exponent  of  foreign  exchange  fluctuations.  The  most 
comprehensive  is  probably  Guillaume  et  al.  (1997),  who  calculate  the  exponent  ^^  of  the  price 
movements  between  the  major  currencies.  At  the  shortest  frequency  At  =  10  minutes,  they  find 
exponents  with  average  (^r)  =  3.44,  and  a  standard  deviation  0.30.  This  tantalizingly  close  to  the 
stock  market  findings,  though  the  standard  error  is  too  high  to  draw  sharp  conclusions*. 

2.4     The  half  cubic  law  of  volume:  (q  ~  3/2 

TVying  to  understand  the  origins  of  the  cubic  law  for  returns,  Gopikrishnan  et  al.  (2000a)  looked 
at  the  distribution  of  volume^  in  the  U.S.  and  found  an  exponent  around  1.5:  Cq  =  1.53  ±  .07. 
The  same  value  ^,  ~  3/2  held  for  the  distribution  of  individual  trades.  Maslov  and  Mills  (2001) 
find  likewise  (^g  =  1.4  ±  .1  for  the  volume  of  market  orders.  To  test  the  universality  of  this  finding, 
we  examine  30  large  stocks  of  the  Paris  Bom-se  from  1995-2000,  which  contains  approximately  35 
million  records.  We  reported  the  finding  in  Figure  2  and  found  (^g  =  1.5  ±  .1. 


*  Suppose  returns  are  distributed  according  to  a  cubic  law  with  a  the  density  proportional  1/  (l  +  r^)  .  Examining 
the  top  1%  of  10,000  points,  the  standard  deviation  of  measured  exponents  is  about  0.4,  for  a  mean  of  about  3. 

'For  stock  i,  we  look  at  the  fluctuations  of  Vu/Vi,  where  Vi  is  the  mean  volume  Vu-  This  makes  stocks  comparable, 
and  the  "number  of  shares"  and  "dollar  volume"  would  give  equivalent  measures  of  volume. 


2.5     The  cubic  law  of  number  of  trades:  (^^  ~  3.3 

Plerou  et  al.  (2001)  find  an  exponent  (^  =  3.4  ±  .05  for  the  number  of  trades  in  a  15  min  interval  in 
the  U.S.  In  the  Paris  Bourse  data  mentioned  above,  we  find  (n  =  3.17  ±  .1.  Our  theory  will  predict 
the  exponent  of  3.  This  may  due  to  a  flaw  in  the  theory^",  or  to  simply  to  measurement  error^^. 


Normalized  ft gj 

Figure  5:  Cumulative  distribution  of  the  normalized  number  of  transactions  n^t  =  -^At/  (-^At)  ■  Each 
symbol  shows  the  cumulative  distribution  P  (tia*  >  x)  of  the  normalized  number  of  transactions 
riAt  for  all  stocks  in  each  bin  of  stocks  sorted  according  to  size.  An  analysis  of  the  exponents 
obtained  by  fits  to  the  cumulative  distributions  of  each  of  the  1000  stocks  yields  an  average  value 
^^  =  /3  =  3.40  ±  0.05. 


2.6     The  power  law  distribution  of  the  size  of  large  investors:  Cs  —  1 

The  size  distribution  of  many  entities  follows  a  power  law,  and  often  with  exponent  1,  something 
called  a  Zipf's  law.  This  is  true  for  cities  (Zipf  1949,  Gabaix  and  loannides  2003)  and  business 
firms.  Indeed  Axtell  (2001)  and  Okuyama  et  al.  (1999)  have  shown  that  the  distribution  of  firms 
(respectively  in  the  US  and  Japan)  follows  Zipf's  law.  As  they  show  that  the  distribution  of  firms 
follows  Zipf's  law,  it  is  tempting  to  hypothesize  that  the  distribution  of  firms  that  manage  money 
follows  Zipf's  law.  We  found  data  for  an  important  subset  of  those  firms,  mutual  funds'^.  From 
CRSP  one  gets  the  size  (dollar  value  of  the  assets  under  management)  of  all  the  mutual  funds'^ 


'"A  scaling  N  ~  V^  gives  a  value  (^n  =  Cv/C>  so  that  a  scaling  of  the  number  of  trades  N  ~  yO-44  j-gjijej.  that  the 
theoretical  N  ~  V"-^  would  correct  this  misfit.  This  would  mean  that  the  broker  can  do  slightly  better  than  predicted 
by  the  model.  For  very  large  trade,  the  broker  could  have  an  extra  incentive  to  limit  the  number  of  people  to  whom 
the  trade  is  offered,  perhaps  to  limit  information  leakage. 

^  ^  Power  law  relations  are  notoriously  hard  to  measure,  especially  for  positive  variables  with  high  exponents  with 
positive  quantities.  The  reason  is  that  high  exponents  have  fewer  very  large  events,  so  the  measurement  can  be 
affected  by  other  sources  of  noise.  For  instance,  suppose  that  the  true  process  for  the  number  of  trades  Af  is  JV  = 
A^o  +  a,  where  No  has  power  law  3  with  P(No  >  x)  =  kx~^,  and  a  is  a  constant.  The  interpretation  is  that  a 
is  a  constant  "background  noise"  number  of  trades,  and  the  No  represents  the  trades  of  large  investors  as  in  our 
model.  Then  the  CDF  of  N  is  G{x)  =  P{N  >  x)  =  k{x  —  a)~^,  and  the  measured  power  law  exponent  will  be: 
Cn  =  —xG'  (x)  /G  (i)  =  /  (1  —  a/x).  It  tends  to  3  for  very  large  values  of  x,  but  in  finite  samples,  there  is  an  upward 
bias  equal  to  a/No,  where  No  is  the  order  of  magnitude  of  typical  value  of  No  in  the  sample. 

'"Here  we  sketch  the  main  findings.  Gabaix,  Ramalho  and  Renter  (2003)  present  many  more  details. 

'^The  say  200  funds  of  Fidelity,  for  instance,  count  as  200  different  funds,  not  as  one  big  "Fidelity"  fund. 
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from  1961-1999.  For  each  year  t,  we  do  an  OLS  estimation  of  the  power  law  exponent  C,  of  the 
distribution.  We  find  an  average  coefficient  (Q)  —  1.10,  with  a  standard  deviation  across  years^** 
of  0.08.  The  Hill  estimator  technique  gives  a  mean  estimate  (Ct)  =  0.90  and  a  standard  deviation 

IQ  —  (^t)  \  =  0.07.  All  in  all  we  conclude  that,  to  a  good  approximation,  mutual  fund  sizes 
follow  a  power  law  distribution  with  exponent: 

Cs  ^  1  (7) 

For  the  purpose  of  this  paper,  one  can  take  this  distribution  of  the  sizes  of  mutual  funds  as 
a  given.  It  is  in  fact  not  difficult  to  explain.  One  can  transpose  the  explanations  given  for  cities 
(Gabaix  1999,  and  the  references  in  Gabaix  and  loannides  2004)  to  mutual  funds.  A  log  normal 
process  with  small  perturbation  to  ensure  convergence  to  a  non-degenerate  steady-state  distribution, 
explains  the  power  law  distribution  with  an  exponent  of  1.  Gabaix,  Ramalho  and  Reuter  (2003) 
develops  this  idea  and  shows  that  those  assumptions  are  verified  empirically. 

It  is  only  recently,  say  in  the  past  30  years,  that  mutual  funds  have  come  to  represent  a  large 
part  of  the  marketplace.  For  the  earlier  periods,  the  theory  will  work  if  the  distribution  of  large 
agents  still  follow  Zipf's  law  with  a  unit  exponent.  We  do  not  have  direct  evidence  for  this,  but  a 
very  natural  candidate  would  be  the  pension  funds  of  corporations.  It  is  very  likely  that  the  size  of 
the  pension  fund  of  a  firm  with  S  employees  is  proportional  to  S',  so  that  their  pension  funds  also 
follow  Zipf's  law. 

3     The  theory,  assuming  a  power  law  price  impact  of  trades 
3.1     Sketch  of  the  theory 

Assuming  a  power-law  functional  form  for  price  impact,  we  will  show  how  the  power  law  distribution 
of  large  investors  generates,  through  intelligent^^,  tactical  behavior  of  the  traders,  the  exponents 
Cq  =  C?  =  3/2  for  the  volume,  and  ^7.  =  Cw  =  3  for  returns,  volumes  and  number  of  trades.  The 
origins  of  the  power  law  distribution  of  large  investors  are  fairly  well  understood^® .  In  Section  4,  we 
will  provide  an  explanation  for  the  power  law  price  impact. 

In  our  theory,  large  volumes  and  large  returns  are  created  by  trades  made  by  large  agents 
(investors).  When  a  large  agent  wants  to  trade  with  reasonable  speed,  he  moves  the  market.  Trading 
slowly  decreases  the  direct  price  impact,  but  has  other  drawbacks,  as  the  profitability  of  the  trading 
opportunity  may  diminish  in  part  due  to  information  leakages.  We  will  show  that  optimal  trading 
by  large  investors  generates  the  exponents  ^^  =  Ca^  =  3  and  (^,  =  3/2. 

Figvure  6  displays  the  modular  structure  of  the  paper.  It  indicates  the  logical  relations  of  the 
different  parts  of  om  theory.  Its  different  parts  are  fairly  independent,  which  confers  some  robustness 
to  our  theory.  For  instance,  even  if  our  explanation  for  the  square  root  price  impact  Ap  ~  V'^l'^ 
is  erroneous,  the  rest  of  the  paper  still  holds.  The  relations  indicated  in  the  Figure  6  are  those 
articulated  in  the  paper,  except  for  the  link  between  Gibrat's  law  and  Zipf's  law,  which  comes  from 


^^  We  cannot  conclude  that  the  standard  deviation  on  our  mean  estimate  is  0.08  (1999  —  1961  -I- 1)~'®.  The  estimates 
are  not  independent  across  years,  because  of  the  persistence  in  mutual  fund  sizes. 

^^Our  traders,  though  intelligent  (they  try  to  avoid  too  high  trading  costs),  are  not  necessarily  hyperrational  i.e., 
they  may  trade  too  much,  as  in  Daniel  et  al.  (1999)  and  Odean  (2000). 

'^See  Gabaix  (1999),  Gabaix,  Ramalho  and  Reuter  (2003),  and  Gabaix  and  loannides  (2004),  and  the  references 
therein. 
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Figure  6:  The  modular  structure  of  arguments  in  this  paper 


Gabaix  (1999),  Gabaix  and  loannides  (2004)  and  the  references  therein.  It  is  logically  independent 
of  the  paper,  which  is  why  the  link  is  a  dashed  line. 

3.2     Notations  and  some  elementary  links  between  exponents 

3.2.1     Notations 

We  call  C,x  the  power  law  exponent  of  a  random  variable  X,  i.e.  the  number  such  that  for  a  large 
real  x,  P  {X  >  x)  ~  a;~^-^ .  If  all  moments  of  X  are  finite,  the  convention  is  ^x  =  +oo. 

Suppose  a  large  investor  wishes  to  trade  a  large  volume  V.  The  large  investor  divides  V  into  n 
parts  of  sizes  gi,  ■■■,q„,  so  that  V  =  Yll^i  Qi-  ^^  ^^^^  ^P  ^^^  price  impact  due  to  the  entire  trade. 
Over  a  given  time  interval,  we  call  r  the  return,  Q  the  aggregate  volume,  and  N  the  number  of 
trades.  The  variables  Q,  qi,  N,  r  are  directly  measurable  while  V,  n,  and  Ap  are  not.  We  have: 


j  traded  in  that  interval 

Q=         E         ^^ 

j  traded  in  that  interval 


71, 


(8) 

(9) 
(10) 


j  traded  in  that  interval 

where  u  reflects  other  sources  of  price  movement,  such  as  news. 
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3.2.2  Link  between  aggregate  £ind  trader-based  exponents 

Our  generates  power  law  behaviors  for  the  distribution  of  'unobservable'  variables  such  as  V,  n 
and  Ap  in  (8  )-(10).  As  we  show  in  the  proposition  below,  we  relate  the  power-law  exponents  for 
measurable  quantities  to  the  unobservable  variables,  e.g.,  we  show  that  the  measurable  power  law 
exponent  C,q  =  C,v- 

Proposition  1  We  make  the  maintained  assum/ption  that  the  Vj  are  independent,  that  the  Uj  are 
independent,  and  Apj  and  u  are  independent,  and  that  ^Ap  <  Cu  •   Then 

C.  =  Cap  (11) 

Cq  =  Cv  (12) 

Civ  =  Cn  (13) 

Proof.  We  use  results  (55)  from  Appendix  A.   ■ 

Assumption  ^Ap  <  Cu  means  that  the  power  law  of  retirrns  is  due  to  the  feattures  of  our  model, 
not  to  news.  The  Proposition  that  the  sum  of  power  laws  with  the  same  exponent  ^  is  also  power 
law  with  the  same  exponent  C,  is  very  general  (see  Appendix  A).  In  particular,  our  results  will  hold 
under  much  milder  assumptions  of  independence;  as  shown  in  Appendix  B,  they  hold  even  if  there  is 
autocorrelation  in  market  liquidity  and  trade  arrival.  For  the  sake  of  clarity,  we  present  our  results 
under  the  simplest  assumption  of  independence. 

Next,  we  discuss  an  equally  simple  but  more  substantive  link. 

3.2.3  Link  between  the  exponents  of  return  and  volume 

A  large  literature  explores  the  relation  between  a  trade  of  size  V  and  a  price  impact  r  =  f{V). 
The  initial  finding  of  Hasbrouck  (1991)  is  an  increasing,  concave  price  impact.  This  is  usefully 
parametrized  by  a  power  law  for  large  volumes  V: 

Ap  =  hV  (14) 

The  following  Proposition  links  the  exponents  of  volume,  return  and  price  impact. 

Proposition  2  Suppose  that  we  have  a  power  law  distribution  of  volumes  with  exponent  ^q  and  a 
power  law  price  impact  (14)  with  exponent  7.  Then  we  have  a  power  law  distribution  of  returns  with 
exponent: 

Cr  =  Cq/7- 

Proof.  As  Ap  =  hV"', 

P{Ap>x)=P  {hV  >x)=p(y>  {x/hf''^ 


(ix/hf^) 


so  Cap  =  ^qH-  We  use  Proposition  1  to  conclude.    ■ 

Thus  the  empirical  evidence  Cq  =  3/2  and  (^^  =  3  is  consistent  with  a  price  impact  exponent 
7  =  1/2.  We  will  give  empirical  and  theoretical  support  for  7  =  1/2  in  Section  4.  In  the  rest  of  this 
Section,  however,  we  assume  a  general  7. 
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3.3     A  main  result 

We  start  with  the  following  puzzle.  If  each  fund  i  of  size  Si  traded,  at  random,  a  volume  Vi 
proportional  to  Si,  Vi  =  aiSi,  then  the  distribution  of  volumes  would  follow  (^y  =  Cs-  Then  by  (7) 
and  Proposition  1,  we  would  have  (^q  ~  1,  in  contrast  to  the  empirical  value  (^g  ~  3/2. 

The  larger  value  of  ^q  compared  to  (^s  means  that  distribution  of  volumes  is  less  fat-tailed  that 
the  distribution  of  size  of  investors.  This  means  that  large  traders  trade  less  often  than  small  traders, 
or  that,  when  they  trade,  they  trade  in  volume  less  than  proportional  to  their  sizes.  How  can  we 
understand  this  behavior? 

Intuitively,  a  likely  reason  is  that  large  traders  have  large  price  impacts,  and  have  to  moderate 
their  trading  to  avoid  paying  too  large  costs  of  price  impacts.  This  suggest  that  an  important 
quantity  is 

,  Annual  amount  lost  by  the  fund  in  price  impact 

Value  S  of  the  assets  under  management  of  the  fund 

For  example,  if  funds  of  size  S  pay  on  average  1%  in  price  impact  a  year,  c  (5)  =  1%. 
The  above  arguments  motivate  the  following  theorem  which  constitutes  a  central  result  of  this 
paper: 

Theorem  3  If  the  following  conditions  hold: 

(i)  The  size  (assets  under  managements)  of  large  financial  market  participants  follows  a  power 
law  with  exponent  Cs; 

(a)  A  large  volume  V  has  a  price  impact: 

Ap  ~  V^  (15) 

with  7  >  0; 

(Hi)  Funds  trade  in  volumes 

V^S^  (16) 

for  some  d  >  0; 

(iv)  Funds  adjust  trading  frequency  and/or  volume  so  as  to  pay  an  amount  transaction  costs, 
relative  to  their  assets: 

c{S)  =  C  (17) 

(survival  constraint); 

Then  returns  and  volumes  are  power  law  distributed  with  the  exponents: 

C.  =  l+-  +  ^^  (18) 

7  70 

Cq  =  1  +  7+^^  (19) 

We  state  a  useful  Corollary. 

Corollciry  4  In  additions  to  the  assumptions  of  Theorem  3,  assume:  (i)  Zipf  's  law  for  investors, 
^5  =  1,  and  (ii)  the  square  root  price  impact,  Ap  ~  1/V2  ^j_g_  j  —  \/2),  then  returns  and  volumes 
are  power  law  distributed  with  with  exponents  resp.  3  and  3/2: 

Cr=3 

(:q  =  3/2. 
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We  view  Theorem  3  and  its  Corollary  4  as  describing  the  essential  features  of  large  movements 
in  market  activity.  The  power  law  distribution  of  returns  and  volumes  comes  from  the  power  law 
of  large  investors  (Condition  i)  and  the  power  law  impact  of  trades  (Condition  ii).  Traders  want  to 
avoid  paying  too  much  in  price  impact  (Condition  iv),  which  moderates  those  power  laws. 

Corollary  4  is  relevant,  as  empirically  it  seems  that  ^5  ~  1  and  7  ^  1/2.  Section  4  will  discuss 
reasons  for  7  =  1/2.  In  general,  the  fatter  the  tails  of  the  distribution  of  large  agents,  the  fatter 
the  tails  of  the  distributions  of  volumes  and  returns.  This  may  be  relevant  in  highly  regulated  or 
immature  markets,  which  might  not   have  (^5  ~  1. 

We  comment  on  the  hypotheses  before  proceeding  to  the  proof.  We  have  discussed  Condition 
(i)  in  Section  2.6.  The  distribution  of  many  other  social  entities,  such  as  firms,  cities  and  invidual 
wealth,  follows  power  law  distributions. 

Condition  (ii)  is  consistent  with  most  empirical  work  (Hasbrouck  1991,  Hasbrouck  and  Seppi 
2002,  Plerou  et  al.  2002,  Lillo  et  al.  2003).  We  endogenize  it  in  Section  4. 

Condition  (iii)  is  a  weak  technical  assumption  which  we  endogenize  in  Section  3.4.  Condition 
(iii)  can  be  further  weakened  considerably.  The  result  would  still  hold,  up  to  logarithmic  corrections, 
if  we  had  V  (5)  ~  exp  [aS"")  for  positives  a,  a.  Condition  (iii)  assumes  that  V  (5)  "fast  enough"  in 
5,  i.e.,  more  rapidly  than  logarithmic. 

Condition  (iv)  means  that  funds  in  the  upper  tail  of  the  distribution  pay  roughly  similar  annual 
price  impact  costs  c  (S)  reaches  an  asymptote  for  large  sizes.  We  provide  a  foundation  for  this  in 
Section  3.4.  We  interpret  condition  (iv)  as  an  evolutionary  "survival  constraint".  Funds  that  would 
have  a  very  large  c(5)  would  have  small  retinrns  and  would  be  eliminated  from  the  market.  The 
average  return  r  {S)  of  funds  of  size  S  is  independent  of  S  (Gabaix,  Reuter  and  Ramalho  2002). 
Since  small  and  large  funds  have  similarly  low  ability  to  outperform  the  market  (see  e.g.  Carhart 
1997),  c{S)  is  also  independent  of  S. 

We  now  proceed  to  the  proof  of  the  proposition. 

Proof.  Each  time  an  investor  trades,  he  incurs  a  price  impact  proportional  to  VAp.  Given  (ii), 
this  cost  is  hV^'^'^ .  If  F  [S)  is  the  an  annual  frequency  of  trading,  the  total  annual  dollar  lost  in 
transactions  costs  is  F  (5)  h[V  {S)Y'^'',  i.e.  a  fraction 

c{S)  =  F{S)h[V{S)]^+"'/S  (20) 

of  the  value  S  of  this  portfolio.  Hypothesis  (iii)  gives  that  V  {S)  and/or  F  (5)  will  adjust  so  as  to 
satisfy: 

F(5)-5-y(5)-'^+^^  (21) 

By  condition  (i)  there  is  a  number  G  {S)  ~  S~''^  of  traders  of  size  greater  than  x  .  So  the  density 
of  traders  of  size  S  is  p{S)  =  —G'  (5)  ~  S~^~^^ .  They  trade  with  frequency  F  (5)  given  in  (21). 
Volumes  V  >  x  correspond  to  traders  of  size  5'  such  that  S^  >  x.  Putting  all  this  together: 

P{V>x)^  [       F{S)p{S)dS^  f         5^-(^+^^*5-^-'^^d5. 

^    [_5-(l+7)5-Cs  +  ll°°        ^3.-(l+7)-(Cs-l)/i5 

which  gives  (19).  The  value  of  (^^  comes  from  this  and  Proposition   2.    ■ 
One  can  state  a  result  independent  of  a  particular  value  of  7. 

Corollary  5   (Reciprocity  law  between  the  exponents  of  returns  and  volume).   Under  the  assumptions 
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of  Theorem  3,  assuming  also  ^5  =  1  (Zipf's  law  for  large  investors),  one  has,  independently  0/7; 

Proof.  This  comes  from  direct  calculation  from  (18)  and  (19): 

1        1    _         1  1       _ 

C.^Cq  "  1  +  1/7^  1  +  7  ~   ■ 

■ 

The  "reciprocity  law"  (22)  is  likely  to  be  particularly  robust,  as  it  does  not  depend  on  a  specific 
value  of  7.  It  is  verified  for  our  empirical  values  ^r  =  3,  Cq  =  3/2. 

3.4     An  optimizing  model  that  illustrates  Theorem  3 

It  is  useful  to  illustrate  the  "bare-bones"  model  in  Theorem  3  by  an  economic  model.  To  this  end, 
we  introduce  a  motivation  to  trade  and  a  reason  for  why  large  funds  moderate  their  price  impact 
as  in  condition  (iii).  For  the  motivation  for  trade,  we  suppose  that  fund  managers  perceive  i.i.d. 
mispricings  that  are  independent  across  different  funds.  We  will  consider  a  world  made  of  K  assets 
indexed  by  z,  but  the  intuition  works  for  K  =  1  asset.  Suppose  a  manager  believes  that  there  is 
a  mispricing  Mj,  i.e.  that  the  asset  will  have  excess  returns  of  size  Mj.  Without  loss  of  generality, 
we  here  take  Mi  >  0,  so  that  the  manager  want  to  buy  the  asset.  Suppose  that  the  fund  managers 
trade  on  perceived  mispricings  which  occur  them  Fi  <  F^^^^  times  a  year^^.  Given  a  mispricing  Mj, 
and  a  volume  Vi  invested  in  it,  the  dollar  profit  from  a  trade  of  size  Vi  is  the  mispricing  in  excess  of 
the  transaction  cost:  Vi  {Mi  —  Ap^),  where  the  price  impact  Ap,  is 

Api  =  hiV;j  .  (23) 

In  the  condition  (iii)  of  Theorem  3:  Traders  do  not  wish  to  lose  more  than  C%  of  their  funds 
in  transaction  costs.  Intuitively,  this  arises  because  if  they  follow  their  instincts  to  trade  too  much, 
their  returns  may  be  too  small.  To  capture  this,  we  assume  that  the  trader  believes  with  probability 
77  the  mispricings  are  real,  while  with  probability  1  —  tt  the  mispricing  is  a  false  perception.  If  the 
trader's  estimated  mispricings  are  reliable  his  expected  returns  are 


r+  =  -^X^F,l^i(M-ApO 


^^ Before  we  proceed,  we  comment  on  possible  interpretations  of  Mi.  The  simplest  is  that  the  trader  thinks  that 
the  market  price  is  wrong.  There  is  a  lot  of  evidence  that  investors  do  not  have  all  the  same  assessment  of  the  fair 
valuations,  which  give  rise  to  the  perceived  mispricing  Mi.  Bagwell  (1992)  offers  early  evidence.  Diversity  in  forecasts 
offers  direct  evidence  for  this,  and  the  very  high  trading  volume  offers  strong  indirect  suggestion  of  heterogenous 
valuations.  There  is  evidence  for  return  predictability  (Barberis  and  Thaler  2003,  Hirshleifer  2001,  Shleifer  2000)  - 
so  mispricings  get  progressively  corrected,  captured  in  our  model  by  the  fiT  term.  A  second  interpretation  for  the 
trading  motive  Mi  is  that  a  given  point  in  time,  the  investor  may  hold  his  optimum  portfolio.  The  difference  between 
his  current  portfolio  and  his  optimum  portfolio  costs  him  fj,  per  unit  of  time,  and  is  proportional  to  the  dollar  amount 
at  stake,  V .  Hence  taking  T  units  of  time  to  realize  his  trade  costs  him  ^iTV .  So,  the  derivations  in  the  paper  go 
through  with  that  interpretation.  We  now  proceed  to  examine  the  tradeoff  that  the  manager  faces  between  taking 
advantage  of  perceived  opportunites  M  and  the  trading  costs. 
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But  if  his  intuitions  are  wrong,  and  markets  are  efficient,  his  returns  are: 

i 

The  traders  has  an  objective  function  U  on  returns,  so  that  his  expected  utihty  is: 

W  =  TrUr,r{r+)  +  {l-7r)Ur,r{r-).  (24) 

We  will  consider  the  class  of  utility  functions: 

Ur,r  {r)  =u{r  +  r)-  Ze'^'^^'+^l 

for  some  positive  Z,  T  and  some  increasing  weakly  concave  function  u.  Ur,r  is  increasing  and  concave 
and  satisfy: 

lim  Ur  r  (r)  =  u  {r  +  r)  if  r  +  r  >  0 
r— »oo      ' 

=  —CO  if  r  +  r  <  0 

In  other  terms,  the  trader  has  a  standard  utility  function  for  returns  r  above  a  threshold  —r,  but 
wants  to  avoid  getting  returns  below  —r.  We  can  think  of  — r  <  0  as  some  low  underperformance 
threshold  below  which  the  fund  managers  lose  their  job.  F  is  an  aversion  to  underperformance. 
We  state  the  Proposition. 

Proposition  6  In  the  setup  above,  suppose  that  the  "cost  cap "  r  is  low  enough,  so  that  the  survival 
constraint  (iv)  in  Theorem,  3  holds  (c.f.  Eq.  (S3)).  Then,  as  the  aversion  to  underperformance  T 
become  very  large,  hypothesis  (ii)  and  (Hi)  of  the  Theorem  3  hold  for  each  stock  i,  with 

5  =  1/(1  +  7).  (25) 

The  size  of  trades  is: 

y,  =  fc  T:^ y  %V(i+7)  (26) 

for  a  constant  k  given  in  (34)-  The  total  amount  spent  in  transaction  costs  is  the  value  of  the 
portfolio  times  r. 

Thus  this  economic  model  generates  the  conditions  (iii)  and  (iv)  of  Theorem  3.  It  makes  a 
prediction  (25)  on  the  value  of  5  that  could  in  principle  be  tested.  The  volume  traded  increases  with 
the  size  of  the  trader,  but  a  in  less  than  linearly.  The  elasticity  9  In  V/91n  5  =  1/(1+7)  is  higher 
in  markets  in  markets  with  low  eleisticity  of  price  impact  7. 

We  also  see  that,  in  term  of  Theorem  3,  if  the  constraint  (iv)  holds  for  the  whole  portfolio,  it 
will  hold  stock  by  stock  too. 

It  was  expected  that,  £is  in  (26),  funds  trade  in  larger  volume  when  they  are  bigger,  when  the 
perceived  profit  opportunities  Mj  are  larger,  and  the  prefactor  hi  of  price  impact  is  small. 

Proof.  We  study  directly  the  case  F  =  00,  as  the  problem  is  continuous  in  F.  The  trader's 
problem  is: 

K 

max^Fiy,(Mi-Api)  (27) 

"    '  i=l 
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K 


s.t.   ^FiVi^p,<rS,  (28) 

i=l 

Fi  <  F^""^"" ,  and  (29) 


The  Lagrangian  is: 


£  =  ^  F,  Vi  (Mi  -  Ap,)  +  A  f  r5  -  ^  F,  V, Ap,  J  +  X^  ac,  (i^"^''  -  F) 

i  \  i  /  i 


and  the  first  order  conditions  are: 

^||=M.-(l  +  A)W-^=0  (30) 

J;||=M,-(l  +  A)(l  +  7)/i,V,^  =  0  (31) 

Eq.  (31)  gives  Vi  >  0,  and  (30)-(31)  give  Ki  =  (1  +  A)7/iiyj  >  0,  i.e.  (29)  binds.  So: 

VT  =  Mi l-io) 

'        (l  +  A)(l  +  7)/ii  ^^ 

The  left-hand  side  of  (28)  is: 

H  (A)  =  [(1  +  7)  (1  +  A)]-'-^/^  Yl  Fr^Ml'^^'-'h-^'^ 

i 

and  (28)  binds  iflf  H  (0)  >  rS,  i.e. 

K  |(l+7)-'-'/^E^'""^/^'^X'^^  (33) 

As  we  assume  this,  A  solves  H  (A)  =  r5.  With  (32),  this  yields  (26)  where: 

n  1/(1+7) 
I 

The  total  cost  constraint  for  stock  i  is  Cj  =  F^^hiV^'*"' /S,  i.e. 


k  = 


^.jnnax^j  +  1/7^-1/7- 


In  particular  ^^  Cj  =  r. 


(34) 
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4     The  square  root  of  price  impact  of  trades:  Evidence  and  a 
possible  explanation 

At  this  stage,  we  understand  how  a  power  law  price  impact  of  trading,  and  a  power  law  distribution 
of  traders  sizes,  gives  rise  to  power  law  distributions  of  volume  and  returns.  We  now  explore  why  a 
particular  value  of  the  price  impact,  7  =  1/2,  is  favored  on  empirical  and  theoretical  grounds.  The 
initial  motivation  is  Proposition  2,  which  shows  that  a  square  root  impact  7  =  1/2  precisely  gave 
the  right  link  between  Cr  =  3  and  (^q  =  3/2.  So  we  first  present  empirical  evidence  that  supports 
7  =  1/2,  and  then  propose  a  model  that  can  explain  this  particular  value. 

4.1     Evidence  on  the  squcire  root  la^v  of  price  impact 

We  provide  evidence  here  for  the  7  =  1/2  in  (14),  i.e.  a  price  impact  function: 

Ap  ~  yi/2  (35) 

We  start  from  the  benchmark  where,  in  a  given  time  interval,  J  i.i.d.  blocks  are  traded,  with  volumes 

Vi, ...,  Vj,  with  i.i.d.  signs  Sj  =  ±1.  The  aggregate  volume  is  Q  =  J2j=i  ^71  ^'^'^  *^^  aggregate  return 

is: 

J 

where  u  is  some  other  somce  of  price  movement,  assumed  to  be  uncorrelated.  Then, 


[r^\Q\=al+h''E 

J2Vj  +  J2^jekVjV,\Q 
3            iitk 

=  cyl  +  h'Q- 

fO. 

We  gather: 


E  [r^  I  Q]  =  ^^  +  h-'Q  (36) 


Figure  7  shows  the  empirical  result'^.  It  reveals  an  affine  relation,  rather  than  any  clear  sign  of 
concavity  or  convexity.  A  formal  test  that  we  detail  in  Appendix  F  confirms  this.  We  thus  name 
(35)  as  a  good  candidate  for  an  empirical  law,  and  now  propose  a  theory  for  it. 

4.2     Sketch  of  oxir  explanation  for  the  square  root  price  impact 

We  now  present  a  sketch  of  our  explanation  for  (35)  before  developing  it  further.  The  basic  mi- 
crostructure  models  (e.g.  Kyle  1985)  give  a  linear^^  price  impact  Ap  ~  V.  Virtually  all  the  empirical 
evidence  (starting  with  Hasbrouck  1991),  however,  shows  a  concave  price  impact.  Put  succinctly, 
the  argument  is  the  following.  In  our  model,  a  trader  who  is  willing  to  wait  for  an  amount  of  time 


'*The  construction  of  the  confidence  interval  requires  special  care,  as  r^  has  infinite  variance. 

'^A  concave  price-volume  is  given  by  Seppi  (1990),  Barclay  and  Warner  (1993),  and  Keim  and  Madhavan  (1996). 
These  models  do  not  predict  a  square  root,  however. 
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Figure  7:  Conditional  expectation  E  [r^  |  Q]  of  the  squared  return  r^  given  the  volume  Q.  The  bands 
represent  95%  confidence  intervals.  The  theory  predicts,  for  large  Q's,  a  relation  E'lr^lQ]  =  a  +  PQ, 
the  "square  root"  price  impact  of  volume,  r  is  in  units  of  standard  deviation,  and  Q  in  units  of  the 
first  moment.  The  time  interval  is  At  =  15  min,  and  the  results  are  averaged  over  N  =  116  liquid 
stocks.  Formal  tests  reported  in  Appendix  F  show  that  one  cannot  reject  iJ[r^|(3]  =  a  +  PQ  large 
enough  (Q  >  3). 


T  to  realize  his  buy  trade  will  incur  a  price  impact: 

V 


(37) 


so  that  Kyle  (1985)'s  linearity  of  the  price  impact  in  V  holds,  but  for  a  given  T.  But  more  patient 
traders  who  are  willing  to  wait  for  a  larger  time  T,  will  be  have  smaller  price  impact.  Suppose  that 
asset's  price  is  increasing  at  a  rate  ji.  The  total  benefit  from  the  trade  of  size  V  will  be: 

B  =  V{M  ~iiT-Ap) 

The  optimal  Ap  maximizes  B  subject  to  (37),  so  that 


which  gives 


/  V 

Ap  =  arg  max  V  [  M  —  a— Ap 

Ap        \  Ap 


Ap  =  {^iVf'^  ~  v"^ 


and  T  ~  V^/^.  In  words,  the  price  impact  of  a  trade  of  size  V  scales  like  less  than  linear  (Ap  ~  V) 
because  large  traders  are  willing  to  wait  for  a  longer  time  T  to  moderate  their  price  impact.  In  other 
words,  in  order  to  execute  a  trade  of  size  y,  traders  are  on  the  average  more  patient  and  wait  for  a 
time  T  ~  V^/'^,  so  that  Eq.  (37)  gives  Eq.  (35).  We  now  proceed  to  the  model  that  gives  (37). 
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4.3     A  microstructure  model  for  the  power  law  impact  of  a  block 

We  will  present  a  search  model  for  the  way  in  which  a  trader  finds  counterparts  for  a  large  trade. 
Our  model  is  a  search  model.  Other  models  include  He  and  Wang  (1995),  Saar  (2001),  Vayanos 
(2001),  Duffie,  Garleanu  and  Pedersen  (2002). 

4.3.1  The  behavior  of  liquidity  providers 

We  will  present  a  model  where  a  seller  of  size  Sj  who  received  from  the  buyer  a  price  concession  Ap 
is  willing  to  supply  a  number  of  shares 

qi  =  asi  \Ap\  (38) 

where  a  is  some  proportionality  factor.  This  linear  supply  could  be  the  linearization  of  a  host  of 
models.  We  present  one  such  model  in  Appendix  D. 

4.3.2  Permanent  vs  Trcinsitory  price  impact 

In  most  microstructure  analyses,  the  price  impact  has  a  temporary  and  a  permanent  component. 
The  typical  implementation  is: 

Assumption  A  on  the  temporary  and  permanent  price  impact.  If  the  mid-quote  price  before  the 
trade  was  Pt-i,  and  the  transaction  price  was  pj'^<^<^<^  =  pj_-^  -f  Apt,  then  the  next  mid-quote  price 
is  on  average  pt+i  =  Pt-i  +  bApt  with  b  €  (0, 1). 

Various  ways  to  model  this  are  offered  in  the  literature  (Biais  et  al.  2002,  O'Hara  1995).  Here 
weadopt  a  very  simple  justification  for  this.  Our  results  would  not  change  if  bt  had  a  seasonal  (e.g., 
time  of  the  day)  component,  or  was  random  with  power  law  C6  >  3. 

Proposition  7  Suppose  that  in  the  model  there  are  risk  neutral  arbitrageurs  who  think  that,  when 
a  traded  price  differs  from  the  previous  quoted  price,  the  trade  comes  from  probability  b  from  an 
informed,  risk  neutral  trader,  and  with  probability  1  —  6  from  uninformed  trader.  Then,  they  will  will 
enforce  Assumption  A  above. 

Proof.  Call  ft  the  fundamental  price,  J-'t  the  information  at  time  i,  and  It  =  1  ii  the  trade  is 
informed  and  0  otherwise,  so  that  b  =  P  {It  =  1).  Arbitrageurs  impose  pt-i  =  E  [/t-i  |  Tt-i],  and 
the  after-trade  price  is: 

p«+i  =  Et  [ft+i  I  pj''""'  and  Tt-i] 

=  bEt  [ft+i  I  p^"''^  h  =  1  and  Tt-i]  +  (1  -  6)  Et  [ft+i  \  pj'^"',  h  =  1  and  Tt-i\ 
=  6(pt_i  +  Ap)  +  (l-6)pt_i 
=  Pt-i  +  bAp. 

■ 

The  above  justification  is  not  particularly  interesting,  as  it  does  not  make  any  prediction.  Its 
only  merit  is  to  close  our  model  in  a  simple  way. 

4.3.3  The  behavior  of  the  large  trader 

We  consider  a  large  trader  A  who  wants  to  buy  a  large  number  of  shares  V.  To  accomplish  the 
trade,  the  trader  will  search  for  counterparts.    When  all  the  counterparts  are  found,  the  trade  is 
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executed,  i.e.  the  trades  with  the  counterparts  are  executed  at  the  same  time.  Our  description  is  a 
stylization  of  the  "upstairs"  market,  and  is  in  the  spirit  of  other  search  models  in  the  literature. 

At  time  t  =  0  trader  A  decides  to  buy  a  quantity  V  and  looks  for  a  counterpart.  Liquidity 
providers,  indexed  by  i,  are  found  stochastically^''.  Because  of  (38),  liquidity  providers  are  willing 
to  sell  the  active  trader  a  quantity  of  shares  Qi  =  aSiAp.  So  after  a  time  i,  the  active  trader  A  can 
buy 

^  asiAp. 

traders  i  that  have  appeared  between  0  and  t 

The  search  process  stops  when  the  desired  quantity  is  reached,  i.e.  at  a  time  T  such  that^^ 

Y^  asiAp  =  V  (39) 

i  arrives  between  0  and  T 

Let  /  denote  the  (Poisson^^)  frequency  of  arrival  of  liquidity  providers,  s  their  average  size,  and  the 

average  time  for  realizing  the  trade  he  T  =  E  \T  \  Ap,  V\.  Taking  expectations  on  (39),  we  obtain 

fTasAp  =  V,  and: 

V 
T  =  -T^^r-  (40) 

ajsAp 

The  trader's  goal  is  to  minimize  two  quantities:  the  execution  time  T,  and  the  execution  cost 
|Ap|.  From  above,  his  objective  function  for  the  given  trade  is: 

B=-V{M  -fiT-Ap)  (41) 

Indeed,  as  above,  the  trader  expects  the  asset  to  have  excess  returns  M.  But  after  a  wait  of  T,  the 
price  has  gone  up  by  fiT,  so  that  the  remaining  mispricing  is  only  M  —  yT.  The  total  dollar  profit 
B  is  the  realized  excess  return  M  —  yT  minus  the  price  concession  Ap,  times  the  dollar  volume 
transacted  V.  Given  (41)  and  (40) 

Ap  =  arg  max  V  [M  -  a — — Ap  |  .  (42) 

Ap       \  afsAp  ) 

This  give  Ap  ~  V^l"^,  the  square  root  price  impact  (35).  We  present  this  formally: 

Proposition  8  In  the  above  setting,  7  =  1/2.   Traders  incur  the  square  root  law  of  price  impact: 

Ap~yi/2^  (43) 

wait  for  an  mean  amount  of  time 

T~  1/1/2 


""An  interpretation  is  that  the  active  trader  A  has,  perhaps  via  his  broker,  calls  successively  potential  buyers.  After 
time  t,  the  number  of  potential  liquidity  providers  are  proportional  to  t. 

-'Here  we  omit  integer  constraints.  Taking  them  into  account  would  complicate  but  not  change  the  analysis. 

--To  increase  descriptive  realism,  we  could  assume  that  the  Poisson  arrival  rate  /  itself  increases  with  the  price 
offered  |Ap|,  as  in  f  {Ap) .  It  is  easy  to  verify  that  our  results  for  the  power  laws  will  not  change  if  the  function 
/  increases,  for  large  |Ap|,  like  less  than  a  power  law  (e.g.  the  arrival  rate  tends  to  saturation  maximal  value  as 
|Ap|  — »  00,  or  say  /  (Ap)  <  aln  (6  +  |Ap|)     for  some  finite  a,  6,  k). 
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and  trade  with  a  mean  number  of  liquidity  providers 


-=(S) 


[t\ 


V 


The  full  expressions  are: 

1/2 

(44) 

Proof.    The  loss  function  (39)  gives  (44)  while  (45)  comes  from  (40).    As  liquidity  providers 

arrive  with  frequency  /,  we  have  £■  [n  |  V]  =  fE  IT  \V\  .   m 

So  large  traders  create  large  price  impacts,  and  have  to  accept  a  delay  in  the  execution  of  their 
trades.  This  is  qualitatively  consistent  with  the  stylized  facts  found  in  the  empirical  literature  : 
the  price  impact  |Ap|  is  an  increase  and  concave  function  of  the  trade  size  V  (Hasbrouck  1991, 
Holthausen  et  al.  2000,  who  find  that  a  large  part  of  the  impact  is  permanent),  impatient  traders 
(high  cost  of  time  ^)  will  have  a  bigger  price  impact  Ap  (Chan  and  Lakonishok  1993,  1995,  Keim  and 
Madhavan  1996,  1997,  Lo,  MacKinlay  and  Zhou  1997,  Breen,  Hodrick,  Korajczyk  2002).  Our  theory 
gives  a  precise  quantitative  hypothesis  for  this  price  impact  and  delay,  Ap  ~  T/i/^  and  T  ~  V^^'^. 
Future  research  might  examine  directly  whether  those  square  root  predictions  (for  large  trades)  hold 
in  the  data. 

4.4     The  distribution  of  individual  trades  q  vs  distribution  of  target  vol- 
umes Q 

Theorem  3  gives  reasons  for  distribution  of  target  volumes  with  C,v  —  3/2.  The  "upstairs"  mechanism 
above  provides  a  way  to  divide  them  into  a  several  smaller  trades  of  size  qi .  We  describe  the  resulting 
distribution  below^* . 

Proposition  9  In  the  search  process  above,  if  distribution  of  target  volumes  has  power  law  exponent 
(q  >  1,  and  the  distribution  of  the  sizes  of  liquidity  providers  has  power  law  exponent  (^^  =  1,  then 
the  distribution  of  individual  trades  has  power  law  exponent 

C,  =  Co- 

Proof.  We  will  actually  derive  the  Proposition  in  a  more  general  form,  as  here  we  assume  the 
following  supply  function  for  the  number  of  shares  supplies  by  a  liquidity  provider  of  size  Sj  to  a 
large  trader  who  wants  to  buy  a  volume  V: 

qi  =  aSiAp  =  a'siV^ 

for  a  7  G  (0, 1),  not  necessarily  7  =  1/2.  Individual  trades  of  size  x  come  from  large  trades  of  size 
V  (>  x).  There  is  a  density  p{V)  ~  l/^^i'+i)  of  them.  Each  large  volume  V  generates  a  number 


^^More  technical,  rigorous  results  are  given  in  Gabaix,  Gopikrishnan,  Plerou  and  Stanley  (2003b). 
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n  {V)  =  V/E  [q]  ~  V^-''  of  trades.  So  we  find: 

P(q^>x)r^   f       p{V)niV)P{s^V''  >x)dV 

Jv>x 


As  the  liquidity  providers'  size  distribution  is  power  law  with  exponent  (^^  =  1, 


P  {siV^  >x)=P{si>  x/V)  ~  {x/vy'^ 


and 


SO  we  have  C,q  =  Qv-   ■ 

A  very  similar  proof  would  show  that  C?  =  Cv  +  {C,s  ~  1)  (1  ~  7)  for  Cs  7^  1- 

4.5     An  optimizing  intertemporal  model  that  joins  power  laws  and  mi- 
crostructure 

We  now  join  our  model  of  intertemporal  trading  with  the  model  of  Section  3.4,  and  our  microstructure 
model  of  Section  4.3.  It  turns  out  that  a  key  ingredient  is  price  leakages.  In  their  absence,  traders 
would  trade  very  slowly,  creating  very  little  price  impact.  Leakage  is  an  important  concern  in  trading 
large  blocks.  Keim  and  Madhavan  (1996)  shows  that  in  their  sample  leakage  more  than  doubles  the 
price  impact  of  large  trades^* . 

We  assume  that,  during  the  search  process  for  the  block  trade  by  the  large  trader  A,  the  price 
increases  at  a  rate  ii  +  v,  where  v  represents  the  rate  of  leakage.  There  is  leakage  when  an  investor 
B,  after  hearing  that  A  is  going  to  buy,  try  to  "front  run"  A  and  buy  before  him.  Given  that  leakage 
affects  the  execution  price,  it  should  be  included  in  the  transaction  costs,  i.e.  count  in  the  constraint 
(iii)  of  Theorem  3. 

For  the  ideas  above,  we  propose  the  following  simplified  formalization.  Suppose  that  with  a 
probability  tt  liquidity  provider  i  will  front  run  A,  i.e.  trade  before  him.  Liquidity  provider  i  will 
have  a  mean  price  impact  /  (s,).  Hence  the  price  pressure  associated  with  trading  the  block  is  an 
increase  of  the  price  per  unit  of  time: 

i/  =  7r/(si)/, 

where  /  is  the  number  of  liquidity  providers  contacted  per  unit  of  time.  The  fact  that  i/  >  0  is 
important,  but  the  exact  expression  of  v  does  not  very  important. 

We  adopt  the  same  structure  of  beliefs  and  objective  function  as  in  Section  3.4.  In  the  trader's 
opinion,  the  dollar  profit  from  a  trade  of  size  V  is:  V  (M  —  (/i  +  !^)Ap).  So  if  he  has  true  intuitions 
about  the  market,  his  the  excess  returns  are: 

r+  =  F  ■  y  (M  -  (a*  +  i/)T  -  Ap)  IS 

If  his  intuitions  about  the  market  are  false,  his  excess  returns  are: 


-''Without  leakage  the  price  impact  if  4.3%,  while  with  leakage  it  is  10.2%. 
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r-  =-F-V  {vT  +  Ap)  IS 

Hence,  the  price  impact  includes  not  only  At,  but  also  the  price  increment  due  to  leakage  i/T. 
The  trader  maximizes  W  in  (24)  subject  to  F  <  F"^^^  and  (39),  which  we  rewrite  as: 

Ap  =  a^  (47) 

The  solution  is  given  by  the  next  Proposition. 

Proposition  10  In  the  market  setup  above,  we  assume  that  the  trader  maximizes  the  objective 
function  W  given  in  (24).  In  the  regime  where  F  ^  1  (large  aversion  to  underperformance)  and 
M  ^  S^^^fi  (which  can  be  more  concretely  expressed  M  ^  fj,T  for  the  optimal  solution),  the  solution 
of  problem  above  yields  V  '^  S^  with  d  =  2/3.  Thus,  the  conditions  of  Theorem  3  hold  with  5  =  2/3 
and  the  square  root  price  impact  7  =  1/2. 

Proof.  It  is  easy  to  see  that  the  constraints  bind.  Consider  the  limit  /it  =  0.  The  problem 
becomes: 

maxF^^^yM-rS 

Ap,V  ~ 

s.t.  F"^^  ('^"X"  +  ^P)^  =  ^^  (48) 

Optimizing  over  Ap,  we  get  Ap  =  {i/aVf^'^,  and  (48)  gives  2V^^'^F'^^  {ua)^^"^  =  rS,  hence  V  ~ 
52/3 

The  proof  is  now  complete.  In  addition,  it  may  be  instructive  to  see  which  variables  matter  for  the 
limit  At  ->  0.  For  this  define  x  =  (uaV)^^'^  /Ap,  B  =  rF""^"  "^  (^/a)"^/^  77  =  B^^^M-'^S^/^  {alvf^  /x, 
and  y  =  BSV~^^^ .  The  problem  becomes: 


maxy 


'    —  T\xy     s.t.  X  -\-  X      =  y 


This  forces  y  >  2,  and  leads  to  a;  =  ^(y)  =  [y  —  (j/^  —  4)       J  /2,  and  2/(77)  =  arg  maxy  ^^^  — 

T](,  {y)  y~^.We  have  y  (0)  =  2,  and  the  solution  is  continuous  in  77.  Thus  we  get  y  (77)  =  2  +  o  (77),  i.e. 
y  =  52/3(l  +  o(Mri/35i/3M-i)).    ■ 

Hence  our  predictions  are  confirmed  by  this  framework.  The  cubic  law  arises  if  large  traders  are 
forced  to  trade  quickly  because  of  price  leakage. 

4.6     Some  comments  on  the  model 

4.6.1     Arbitrage 

Ova  model  is  does  not  leave  scope  for  short-term  arbitrage,  as  the  trading  decisions  are  unpredictable. 
However,  it  offers  some  scope  for  predictability  of  long  term  returns.  Indeed  as  volume  moves  prices 
without  necessarily  informational  reasons,  prices  deviate  from  fundamentals,  and  they  will  revert  in 
the  long  term.  There  is  a  evidence  for  this  phenomenon,  see  e.g.  Campbell  and  Shiller  (1988).  So 
our  model  requires  limited  arbitrage  of  long  term  returns.  There  are  good  reasons  why  long  term 
arbitrage  is  difficult,  see  e.g.  Shleifer  and  Vishny  (1997)  and  Shleifer  (2000). 
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4.6.2      Plausibility  of  very  large  volumes  and  price  impact 

The  literature  finds  a  big  impact  of  big  traders.  Chan  and  Lakonishok  (1993,  1995)  find  a  range 
of  30-100  bps,  Keim  and  Madhavan  (1996),  looking  at  smaller  stocks,  find  400  bps.  There  are  also 
many  colorful  examples  of  large  investors  moving  the  market.  Some  are  reported  by  Corsetti,  Pesenti 
and  Roubini  (2002),  Coyne  and  Witter  (2002)  and  the  Report  on  the  1987  crash  (United  States' 
Presidential  Task  Force  on  Market  Mechanisms  1988). 

As  it  is  difficult  to  detect  exogenous  trades,  measuring  a  price  impact  is  difficult.  There  is  a  more 
"intuitive"  way  to  convince  oneself  that  a  large  fund  is  likely  to  move  prices.  The  typical  yearly 
turnover  of  a  stock  is  50%  (Lo  and  Wang  2001),  so  that  with  250  trading  days  per  year,  its  daily 
turnover  is  0.5/250=0.2%  of  the  shares  outstanding.  Consider  a  moderate  size  fund,  e.g.  the  30th 
largest  fund.  At  the  end  of  2000,  such  a  fund  held  0.1%  of  market^^.  So,  on  the  average,  it  will  hold 
0.1%  of  the  capitalization  of  a  given  stock.  So  if  the  fund  wants  to  sell  its  holdings,  it  will  create 
an  additional  0.1%  in  turnover  of  stock,  while  the  regular  turnover  is  0.2%  per  day,  i.e.,  the  fund 
will  have  to  absorb  half  of  the  daily  turnover.  So,  the  size  of  the  desired  trade  of  this  fund  is  quite 
important  compared  to  the  normal  turnover.  This  supports  the  idea  that  large  funds  are  indeed 
large  for  liquidity  of  the  market,  and  supports  the  assumption  that  big  traders  will  pay  attention  to 
their  trading  strategy  to  moderate  their  price  impact. 

5     Assessing  some  further  empirical  predictions  of  the  model 

5.1     Contemporaneous  behavior  of  several  mesisures  of  trading  activity 

5.1.1      Definitions 

The  above  model  proposes  an  explanation  for  the  power  laws  of  financial  market  activity.  We 
now  show  how  it  correctly  explains  another  series  of  patterns  in  trading  activity,  pertaining  to  the 
joint  distributions  of  returns,  volumes,  and  their  variants.  More  precisely,  we  fix  a  time  interval 
At  =  15min.  We  will  consider  the  return  rj,  the  volume^®  Qt  and  number  of  trades  Nt  in  interval 
{t  —  At,  t] .  We  also  partition  those  quantities  in  buys  and  sells.  Suppose  each  trade  i  occurring  at 
time  z,  with  a  quantity  of  shares  exchanged  qi,  and  define  e^  =  +1  is  the  trade  can  be  identified  as 
buy  initiated,  Ej  =  —  1  if  it  was  sell-initiated,  Ej  =  0  if  no  identification  is  possible^^.  We  define  N^ 
as  the  net  number  of  trades,  i.e.  number  of  "buy"  trades  minus  the  number  of  "sell"  trades.  Finally 
we  define  Q'f  as  the  net  volume,  i.e.  number  of  shares  exchanged  that  come  from  a  buy  order  minus 


■^^It  had  $19  billion  in  assets  under  management.  The  total  market  capitalization  of  The  New  York  Stock  Exchange, 
the  NASDAQ  and  the  American  Stock  Exchange  was  $18  trillion. 

^^The  dollar  value  of  the  shares  traded,  or  the  number  of  the  shares  traded  divided  by  the  number  of  shares 
outstanding  give  the  same  results,  as  they  are  proportional  in  the  short  term. 

■^'We  us  the  Lee-Ready  (1991)  algorithm.  We  identify  buyer  and  seller  initiated  trades  using  the  bid  and  ask 
quotes  PbW  and  PaW  at  which  a  market  maker  is  willing  to  buy  or  sell  respectively.  Using  the  mid- value  PmC*)  = 
(Pa{')  +  PbW)/'^  of  the  prevailing  quote,  we  label  a  transaction  buyer  initiated  if  P{t)  >  PmC'),  f"i<l  seller  initiated 
if  P(f)  <  PmC*)-  For  transactions  occurring  exactly  at  PmC*),  we  use  the  sign  of  the  change  in  price  from  the  previous 
trade  to  determine  whether  the  trade  is  buyer  or  seller  initiated,  while  if  the  previous  transaction  is  at  the  current 
trade  price,  the  trade  is  labelled  indeterminate.  We  use  the  prevailing  quote  at  least  5  s  prior  to  the  trade.  17%  of 
the  trades  are  left  indeterminate. 
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We  compute  the  graphs  £  [y  |  X]  for  many  combinations  oi  X,Y  =  r,  Q,  Q',  N,  N',  N'/N, 
Q/N .  The  choice  was  guided  by  an  assessment  of  how  informative  such  a  graph  would  be,  and  how 
non-trivial  the  shape  predicted  by  the  theory  would  be.  The  model  reproduces  all  the  shapes  of 
those  graphs,  as  can  be  seen  in  Figure  8  below. 

We  use  simulations  to  plot  the  model's  predictions.  We  compare  the  empirical  and  theoretical 
conditional  expectations  E\Y  \X],  for  X,Y  =  Q, TV, Q' , N' , r, N'/N, r^, Q/N.  We  show  our  results 
only  for  selected  pairs  of  X,  Y  that  seem  nontrivial. 

5.1.2      Comparison  between  theory  and  facts 

In  Figure  8a-e  we  present  the  empirical  values  on  the  left  panel,  and  the  model's  predictions  on 
the  right  panel.  We  obtained  the  model's  predictions  using  Monte  Carlo  simulations  detailed  in 
Appendix  E.  The  graphs  show  that  the  model  broadly  matches^^  all  the  S  [y  |  X]  that  we  construct 
empirically.  Consider  for  example  E[r  |  Q']  in  Figure  8a.  Our  model  reproduces  on  the  left  panel  the 
increasing,  concave  shape  (in  the  positive  region).  The  aggregation  over  several  trades  flattens  the 
shape  of  the  price  impact  function.  Plotting  the  symmetrical  version,  E  [Q'  \  r],  yields  a  surprising 
finding.  The  empirical  graph,  in  Figure  8b  is  now  Unear.  The  model  matches  this  in  the  right  panel 
of  Figure  8b.  The  cause  of  the  linearity,  again,  is  the  aggregation  over  several  trades. 

The  model  predicts  that  periods  of  large  volume  are  periods  when  large  traders  execute  trades 
in  smaller  pieces,  so  that,  for  a  large  buy  trade,  one  observes  large  number  of  buy  smaller  trades.  A 
compact  way  to  say  that  is  that  A'^  is  high,  and  |A'''/A'^|  is  high  too  when  the  volume  is  high^^.  This 
is  expressed  in  Figure  8c. 

It  is  an  important  feature  of  the  model  that  large  trades  beget  more  trades.  Indeed,  in  our  model 

I  AT' I  ~  AT  (53) 

for  large  N  and  is  dominated  by  one  large  fund  manager  who  desires  to  trade  a  volume  Vj,  and 
creates  a  number  of  orders  V, '  ,  so  that  rij,  N,  and  \N'\  have  the  same  order  of  magnitude,  V, '  . 
Relation  (53)  means  that  most  trades  have  the  same  sign,  i.e.,  move  the  price  in  the  same  direction. 


-* Given  we  have  1.5  million  observations,  it  is  likely  that  statistical  tests  would  show  that  the  fit  is  not  perfect. 
Our  intention  here  is  just  to  illustrate  that  the  fit  is  good.  To  obtain  a  perfect  fit,  one  would  need  to  complicate  the 
model,  in  particular  to  introduce  autocorrelation  in  the  propensity  to  trade.  Here  we  chose  to  show  that  the  simple, 
non-optimized,  i.i.d.  version  of  our  model  can  get  a  pretty  good  fit. 

^^A  very  different  relationship  would  happen  under  another  plausible  model.  Suppose  that  there  are  N  trades 
with  i.i.d.  signs  (£i);_j  jy  and  sizes  (Vi);_j  ;^,  so  that  Q'  =  J^iLi^i^i-  Then  a  large  N  corresponds  to  an  N'  = 
Yli=\  6i  =  O  (  v^),  so  that  N'/N  =  O  (i/Vn)  is  small.  Likewise  a  large  Q'  corresponds  in  part  to  a  large  N,  and 
a  small  \N' /N\,  counterfactually. 
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the  sign  of  the  trade  of  the  large  fund  manager.  Equation  (53)  is  indeed  consistent  with  empirical 
data  shown  in  Figure  8d.  This  contrasts  with  a  simple  alternative  model  where  each  desire  to  trade 
would  create  only  one  trade,  as  in  a  competitive  market.  In  this  alternative  model  we  would  have 
N'  =  Elliot'  where  £i  =  ±1,  leading  to  |7V'|  ~  A^i/^  in  the  tail  events,  or  E[N  \  N']  ~  N'^ 
in  contrast  to  the  data  in  Figure  8d.  Figure  8e  supports  the  view  that  in  periods  of  high  volume 
imbalance  V ,  most  trades  change  the  price  in  the  same  direction.  Indeed,  the  data  and  the  model 
exhibit  a  similar  sharp  transition  of  N'  /N  as  V  changes  sign. 

We  tested  other  cases  too  and  we  find  good  agreement  between  the  theory  and  empirical  data^° . 
Here,  we  report  the  curves  with  the  shapes  we  deem  most  informative.  Given  that  the  model  was 
not  constructed  to  match  those  facts,  we  view  this  as  a  very  reassuring  feature  of  the  model. 

5.2     Some  untested  predictions 

The  model  makes  a  number  of  untested  predictions,  which  we  present  in  this  Section. 

•  As  per  equation  (21),  a  large  fund  of  size  S  will  do  "price-moving"  block  trades'^^  of  size  F  ~  S'' 
with  a  frequency  that  depends  on  size  as  S/V^^^ ,  where  7  is  the  exponent  of  the  price  impact 
(14). 

•  Expression  (26)  for  the  volume  traded  in  an  individual  stock,  in  particular  the  dependence  in 
S  and  the  prefactor  of  the  price  impact  hi  (23) 

•  The  expressions  (44)- (46)  for  the  price  impact,  the  time  to  execution,  and  the  number  of 
trades.  In  particular,  it  would  be  interesting  to  test  their  "square  root"  dependence  in  5,  and 
the  differential  impact  of  ^  (the  impatience  of  the  trader),  a,  /,  s. 

•  The  linear  supply  function  (38),  saying  that  a  trader  willing  to  wait  an  amount  of  time  T  and 
to  pay  a  price  increment  Ap,  will  get  an  amount  of  shares  S  (Ap,  T)  that  will  be  proportional 
to  Ap  •  T  in  the  limit  of  large  Ap  and  T. 

•  We  anticipate  the  values  of  the  four  trading  exponents  a,  ^,  n  and  t  in  Appendix  B  to  be  0, 
1,  0  and  1  respectively. 

Hence  our  theory  makes  predictions  about  the  value  of  24  power  law  exponents^^ .  We  can  make 
also  graphical  predictions  £■  [y  |  X]  graphs  that  we  did  not  build^^,  which  yields  56  predictions.  We 
leave  the  investigation  of  those  values  to  future  research. 

6     Related  literature 

6.1     Alternative  theories 

The  general  theme  is  that  most  theories  (certainly  those  that  we  are  aware  of)  do  not  explain  (and 
cannot  explain  with  a  simple  modification)  the  cubic  and  half-cubic  laws.  At  best,  they  have  free 
parameters  which  can  be  artificially  tuned  to  replicated  the  ^^  =  3. 


^^The  graphs  we  tested  where  E[V  \  r],  E[N  \  r],  £  [r  |  V],  E[V'  |  N'],  E[N'  |  V],  E[V/N  |  r],  and  empirical 
finding  and  theory  matched  very  well. 

^'So  looking  at  the  fund  turnover  will  not  be  the  right  thing  to  do.  To  provide  liquidity,  or  avoid  looking  like 
a  "closet  indexer"  a  large  fund  may  maintain  a  fairly  large  turnover,  though  the  "price  moving"  trades  will  indeed 
happen  less  often,  as  they  move  prices  a  lot. 

'-The  above  bullets  make  respectively  2,  2,  5  •  3  =  15,  1  and  4  predictions. 

''The  printout  of  the  56  theoretical  relationships  E[Y  \  X]  involving  V,N,V',N',r,N'/N,r^,V/N  \s  available  from 
the  authors  upon  request. 
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Figure  8:  Conditional  expectations  for  (a)  E\t  \  Q'],  (b)  E\Q'  \  r],  (c)  E{N  \  Q%  (d)  E\N  \  N% 
and  (e)  E[N' /N  \  Q'].  We  form,  for  each  interval  Ai  =  15  min,  the  quantities  (i)  r,  the  returns,  (ii) 
Qb  (resp.  Qs),  the  number  of  shares  exchanged  in  a  buyer  (resp.  seller)  initiated  trade,  (iii)  Nb 
(resp.  Ns),  the  number  of  buyer  (resp.  seller)  initiated  trades,  Q'  =  Qb  —  Qs-,  and  A'''  =  Nb  —  Ns- 
The  left  panel  shows  the  empirical  values  for  the  116  frequently  traded  stocks  in  the  Trades  and 
Quotes  database  for  1994-1995.  This  represents  1.5  million  observations.  Variables  are  normalized 
to  unit  variance  after  setting  the  mean  to  zero;  for  variables  such  as  volume  for  which  the  variance 
is  divergent,  we  have  by  normalized  the  first  moment  instead.  The  right  panel  shows  the  model's 
predictions,  which  agree  well  with  the  empirical  d^ta. 


6.1.1  The  public  news  based  (efficient  meirkets)  model 

In  this  model,  price  movements  reflect  without  frictions  public  pieces  of  news.  One  would  need  to 
make  large  number  of  assumptions  to  make  this  model  fit  the  empirical  power  laws.  First,  because 
returns  reflects  news,  we  have  to  assume  that  the  distribution  of  news  has  ^j.  =  3. 

This  is  not  the  only  difficulty.  In  the  benchmark  where  volumes  do  not  move  prices,  there  is  no 
reason  for  big  traders  to  trade  less  than  small  ones.  With  no  price  impact,  essentially  all  models  that 
traders  should  trade  in  amount  proportional  to  their  sizes,  so  that  the  volume  Vi  traded  by  trader  i 
will  be  equal  to  Vi  —  aiSi  for  a^  a  random  variable  that  do  not  scale  with  size.  Then,  we  would  get 
Cv  =  Cs  =  IThis  prediction  is  a  major  difficulty  with  the  news  approach.  Otherwise,  to  work  with 
the  public  news  model,  one  would  have  to  assume  ^v  =  3/2 ,  for  some  inexplicable  reason.  Hence 
the  values  of  i^^  and  of  ^q  are  just  assumed  in  that  model  and  do  not  arise  as  a  consequence  of  a 
theory. 

6.1.2  A  mechanical  "price  reaction  to  trades"  model 

In  this  model,  the  desired  N  trades  are  i.i.d.  The  price  has  a  reaction  /  {ejQj)  to  each  trade  of  size 
Qj  and  sign  £j.  So  r  =  X)i=i  /  i^jlj)  and  the  other  variables  are  defined  as  in  (49  )-(52). 

This  model  is  a  variant  of  our  model,  and  so  it  will  tend  to  generate  qualitatively  right  £?  [F  |  X] 
curves.  However  there  are  problems  with  this  model.  Firstly,  we  need  to  assume  Cq  =  3/2.  However, 
given  (^5  =  1,  the  natural  prediction  would  be  i^q  =  1.  Secondly,  we  need  to  assume  (^yv  —  3.  It 
could  be  any  other  value.  Thirdly,  we  also  need  to  assume  a  price  impact  Ap  «  yi/2  to  get  the 
shape  oi  E[r^  \  Q]. 

6.1.3  Random  bilatercJ  matching 

This  has  been  proposed  by  Solomon  and  Richmond  (2001).  The  model  says  that  two  traders  of 
size  5,  S'  meet,  and  the  resulting  volume  is  max(5, 5')  and  the  price  impact  is  min(5, 5').  While 
this  approach  is  reasonable,  it  is  not  without  problems.  As  the  initial  size  distribution  is  (^s  =  1) 
Cmin(s,S')  =  2,  SO  that  one  would  predict  Cq  —  Cr  =  2.  The  authors  rely  on  a  scaling  exponent  of  the 
size  of  agent  ^5  =  3/2,  as  is  approximately  the  case  in  the  US  and  the  UK  in  the  late  1990s,  but 
not  in  other  periods.  A  major  difficulty  for  this  theory  is  that  empirically  the  exponent  is  individual 
wealth  is  highly  variable.  Feenberg  and  Poterba  (1993)  show  that  the  exponent  for  the  wealth  of 
U.S.  individuals  had  large  variations  from  1.5  to  2.5  in  the  last  30  years.  This  large  variability  is 
confirmed  in  a  series  of  papers  on  U.S.  and  international  data  by  Piketty  and  Saez  (e.g.  Piketty  and 
Saez  2002). 

6.2     Related  empirical  findings 

6.2.1      Other  models  for  returns 

The  empirical  literature  has  proposed  other  distributions.  We  are  more  confident  about  our  findings 
rely  on  much  larger  number  of  data  points,  hence  quantify  the  tails  more  reliably.  We  can  also  explain 
previous  findings  in  light  of  ours.  Andersen  et  al.  (2001)  show  that  the  bulk  of  the  distribution  of 
realized  volatility  is  lognormal.  In  independent  work  Liu  et  al.  (1999)  show  that  while  this  is  true, 
the  tails  are  power  law.  Gopikrishnan  et  al.  (1999)  also  report  how,  with  a  small  sample  that  looks 
only  at  the  center  rather  than  the  tails  of  the  distributions,  one  would  find  L6vy  or  truncated  Levy 
distributions,  as  was  found  by  Mantegna  and  Stanley  (1995). 

Finally  Ane  and  Geman  (2000)  propose  a  model  which  is  in  essence  rt  =  Ylij=\  '"■i  where  Ui  are 
normal  shocks.  This  is  a  variant  of  the  Clark  (1973)  model  where  the  clock  is  driven  by  the  number 
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of  trades  rather  than  the  volume.  This  model  has  an  important  empirical  problem:  from  Cw  >  3,  as 
\r\  ~  N^^'^  in  this  model,  we  conclude  Cr  =  2^^  >  6,  so  that  model  fails  to  reproduce  the  empirical 
cubic  law  of  returns.  In  fact,  in  our  model,  results  similar  to  Ane-Geman  would  be  found:  for 
instance,  Sfr^  |  N]  looks  almost  linear  (though  it  is  not  really  linear).  Under  the  null  of  our  model, 
their  model  is  a  good,  but  ultimately  inacciu:ate,  representation  of  reality. 

6.2.2      Buy  /  Sell  asymmetry 

Our  basic  model  has  built-in  full  symmetry  between  positive  and  negative  tails.  This  is  not  a  bad 
approximation  of  reality:  the  exponents  are  roughly  identical,  around  3.  However,  it  is  true  that 
the  exponents  are  slightly  lower  (fatter  tails)  in  the  negative  tail  that  in  the  positive  one.  More 
generally,  one  finds  a  bit  more  skewness  in  the  negative  tail.  One  reason  may  found  in  the  "leverage 
effects".  Volatility  goes  up  after  negative  returns.  Mechanically,  this  skews  to  the  left  the  returns 
over  a  non-infinitesimal  At,  and  the  effect  grows  with  At. 

Our  framework  suggests  another  hypothesis.  Suppose  the  frequency  of  arrival  /  of  liquidity 
providers  is  higher  for  desired  sells  (so  the  liquidity  providers  have  to  buy  the  asset)  than  buys. 
Chan  and  Lakonishok  (1993)  report  the  "Street"  explanation  for  this^''.  When  one  looks  to  buy  a 
large  amount,  one  can  only  deal  with  liquidity  providers  who  already  own  the  assets,  if  shorting  is 
difficult.  But  all  liquidity  provider  can  absorb  a  sell  order.  So  it  is  easier  to  meet  counterparts  for 
sells  than  for  buys,  i.e.  /+  <  /- ,  where  /+  (resp.  /_ )  is  frequency  of  arrival  of  liquidity  providers 
that  are  willing  to  be  counterpart  of  a  buy  (resp.  sell)  order,  /_ .  By  plugging  /+  or  /_  in  (70) 
we  see  that  [Ap+I  >  |Ap_|,  i.e.  the  theory  predicts  higher  price  movements  for  buys  than  for  sells, 
though  also  identical  exponents. 

6.3  Link  with  the  behaviorcJ  and  excess  volatility  literature 

There  is  a  growing  literature  suggesting  that  many  movements  in  the  markets  are  hard  to  explain  by 
any  kinds  of  news.  Cutler,  Poterba,  Summers  (1989)  look  at  daily  returns.  Fair  (2002)  looks  at  the 
largest  1  to  5  min  S&P500  returns.  Both  studies  find  that  many  are  not  explainable  by  news.  The 
puzzle  is  stated  in  related  ways  by  French  and  Roll  (1986),  Roll  (1988),  and  Shiller  (1989).  Trading 
per  se  seems  to  create  volatility  (Evans  and  Lyons  2002),  and  returns,  even  ex  post,  are  hard  to 
explain.  Our  theory  would  predict  that,  on  those  events  of  large  movements  with  no  news,  trading 
volume  is  abnormally  high.  It  would  also  predict  that  those  returns  should  mean  revert. 

6.4  Link  with  the  microstructure  literature 

The  literature  from  which  this  paper  draws  most  is  perhaps  the  microstructure  literature.  This 
literature  is  extremely  vast  (for  reviews  see  e.g.  O'Hara  1997  and  Biais  et  al.  2002),  so  we  will  only 
mention  the  most  directly  relevant  pointers.  Our  theory  makes  predictions  above  the  relationships 
between  volume  and  price  impact,  and  we  reviewed  in  Section  4  the  papers  directly  pertaining  to 
our  theory.  Other  relevant  papers,  that  show  a  long  run  impact  of  trading  on  prices  include  Evans 
and  Lyons  (2002),  and  the  analysis  of  volume  in  Wang  (1994),  Lo  and  Wang  (2001).  Rather  than 
the  fine  analysis  proposed  in  the  literatm-e  (e.g.  Madhavan  and  Cheng  1997),  we  adopted  a  fairly 
stylized  view  of  trading  institutions.  This  was  motivated  by  the  fact  that  the  cubic  exponents  arise 
for  different  market  structmes,  so  that  our  explanation  should  not  depend  too  finely  on  a  specific 
market  structure. 

Our  goal  is  to  explain  the  cubic  power  laws,  and  the  patterns  in  trading  activity  at  a  fairly 
high  degree  of  aggregation  (1/2  hour  as  opposed  to  trade  by  trade).    Hence  we  highlight  some 


Saar  (2001)  provides  an  alternative  explanation  based  on  asymmetric  information. 
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traits  of  reality  -  mainly  the  heterogeneity  between  the  size  of  agents,  and  the  trade-off  between 
execution  cost  and  execution  time  -  that  are  typically  not  central  to  the  microstructure  literature. 
Another  feature  is  that  we  are  agnostic  about  the  importance  of  information  asymmetry  in  the 
determination  of  asset  prices.  We  highlight  more  "macro"  phenomena  that  arise  from  aggregation, 
rather  the  detailed  understanding  of  micro  situations  that  tend  to  be  the  focus  of  the  microstructure 
literature.  Hence,  our  model  and  the  models  of  this  literature  appear  quite  different.  Still,  it  would 
be  desirable,  in  further  research,  to  merge  the  rich  informational  and  institutional  understanding 
from  the  microstructure  literature  and  the  more  macro  approach  of  the  present  paper. 

6.5     Link  with  the  "econophysics"  literature 

This  paper  is  part  of  a  broader  movement  utilizing  tools  from  physics  in  for  the  study  of  economic 
issues.  In  the  same  way  that  the  application  of  physics  to  earth  sciences  and  biology  are  called 
geophysics  and  biophysics,  this  literature  is  sometimes  called  "econophysics".  This  literature  was 
pioneered  by  Mandelbrot  (1963).  This  literature  is  growing,  and  includes  Bak,  Chen,  Scheinkman 
and  Woodford  (1993),  Bouchaud  and  Potters  (2000),  Canning  et  al.  (1998),  Gabaix  (1999,  2003), 
Plerou  et  al.  (1999),  Lux  and  Sornette  (2002),  Levy,  Levy  and  Solomon  (2000),  Mantegna  and 
Stanley  (2000). 

7     Conclusion 

This  paper  proposed  a  theory  of  power  law  behavior  of  several  variables  that  describe  financial 
market  activity.  It  explains  the  cubic  and  half-cubic  exponent  of  returns  and  volume,  and  also 
makes  many  predictions  about  other  quantities.  Section  5.2  presents  numerical  predictions  about  24 
other  power  law  exponents.  This  offers  ample  room  for  future  research.  We  have  examined  some  of 
its  predictions,  in  particular  in  Figure  8.  Further  research^^  is  made  is  by  the  modular  structure  of 
our  arguments,  as  shown  in  by  Figure  6. 

This  theory  can  also  be  used  for  policy  analysis.  The  cubic  law  predicts  the  existence  of  very 
large  returns,  and  Gabaix  et  al.  (2003c)  report  that  stock  market  crashes  such  as  the  1929  or  1987 
crashes  do  do  not  seem  to  be  outliers  to  the  cubic  law.  Hence  if  we  understand  the  origin  of  the 
cubic  law,  we  may  understand  the  origin  of  stock  market  crashes.  Within  the  present  framework, 
Gabaix  et  al.  (2003d)  examine  the  performance  of  traditional  proposals  against  crashes,  such  as 
the  Tobin  tax  or  circuit  breakers.  It  shows  that  they  not  dampen  crashes,  and  works  out  different 
proposals  that  in  principle  would  achieve  that  goal. 

On  the  substantive  front,  our  model  provides  one  possible  quantitative  theory  of  excess  volatility 
in  asset  markets:  it  is  simply  due  to  the  desire  to  trade  of  large  traders  (perhaps  stimulated  by  news). 
More  precisely,  it  is  the  power  law  1  of  the  distribution  of  sizes  that  generates,  through  an  intelligent 
though  not  hyperrational  (people  trade  too  much)  trading  process,  the  power  laws  of  3  in  returns 
and  3/2  in  volume.  On  the  methodological  front,  we  introduce  some  new  questions  that  finance 
theories  should  answer.  Matching,  as  we  do,  the  quantitative  empirical  regularities  established  here 
(in  particular  explaining  the  exponents  of  3  and  3/2,  not  merely  assuming  them),  in  particular  the 
cubic  laws,  would  be  a  sine  qua  non  criterion  for  the  admissibility  of  a  of  volume  and  volatility.  We 
hope  that  the  regularities  we  established  will  sharply  constrain,  and  guide,  future  theories.   Given 


'^Our  model  has  very  little  to  say  about  the  low  frequency  structure  of  trading.  Our  model  contains  a  series  of 
independent  trading  decisions,  and  is  silent  about  the  time-correlations  in  market  activity.  It  is  straightforward  to 
propose  simple  extensions  of  it,  such  as  an  GARCH-type  model  for  the  propensity  to  trade  Jt,  so  as  to  qualitatively 
account  for  the  well-known  long  term  memory  in  volatility  (where  Jt+i  depends  positively  on  past  J'^s  and  |rt|).  The 
proper  analysis  of  this,  however,  is  outside  the  scope  of  this  paper.  We  are  investigating  this  in  ongoing  research. 
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its  empirical  success  and  its  simple  structure,  the  present  model  might  a  useful  point  of  departure 
to  think  about  those  issues. 
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8     Appendix  A:  Some  power  law  mathematics 

We  present  here  some  basic  facts  about  power  law  mathematics,  and  show  how  their  great  aggre- 
gation properties  makes  them  especially  interesting  for  both  theoretical  and  empirical  work.  They 
also  show  how  our  predictions  are  robust  to  other  sources  of  noise. 

A  random  variable  X  has  power  law  behavior  if  there  is  a  ^x  >  0  such  that: 

so  that  the  probability  density  p  {x)  follows: 

1 
p{x) 


xCx+i 


A  more  general  definition  is  that  there  is  a  "slowly  varying"  ^^  function  L{x)  and  a  Cx  s.t. 
p{x)  ~  L{x)  /x''^'^^,  so  that  the  tail  follows  a  power  law  "up  to  logarithmic"  (by  some  abuse  of 
language)  corrections. 

Cx  <  Cy  means  that  X  has  fatter  tails  than  Y,  hence  the  large  X's  are  (infinitely,  at  the  limit) 
more  frequent  than  large  Y's. 

The  definition  implies  that  with  a  >  0,  E  [|^|"]  =  oo  for  q  >  (^x,  and  E  [|X|°]  <  oo  for  a  <  ^x- 
As  an  example,  if  for  returns  (^r  =  3,  then  E  [\r\°']  for  q  >  3.  In  particular,  the  kurtosis  of  returns 
in  infinite^^,  and  their  skewness  borderline  infinite.  For  Y  a  normal,  lognormal,  exponential,  the 
formal  power  law  exponent  is  i^y  =  oo,  given  Y  dies  out  faster  than  any  power  laws. 

Power  laws  have  great  aggregation  properties.  The  property  of  being  power  law  is  conserved 
under  addition,  multiplication,  polynomial  transformation,  min,  max.  The  motto  is  that,  when  we 
combine  two  power  law  variables,  "the  biggest  (fattest=smallest  exponent)  power  law  dominates" . 
Indeed,  for  X,  Y  independent  variables,  we  have  the  formulaire: 

Cx+y  =  CxY  =  Cmax(x,y)  =  min  (Cx,  Cy)  (54) 

=  power  law  of  fattest  variable 

Cmin(x,y)  =  Cx  +  Cy 

For  instance,  if  X  is  a  power  law  for  Cx  <  oo,  and  Y  is  power  law  variable  with  an  exponent 
Cy  >  Cx,  or  even  normal,  lognormal  or  exponential  variable  (so  that  Cy  =  oo),  then  X  +  Y,X  -Y, 
max(X,  y)  are  still  power  laws  with  the  same  exponent  Cx-  So  multiplying  by  normal  variables, 
adding  nan  fat  tail  noise,  summing  over  i.i.d.  variables  preserves  the  exponent.  So  (i)  this  makes 
theorizing  with  power  law  very  streamlined;  (ii)  this  lets  the  empiricist  hope  that  those  power  laws 
can  be  measured,  even  if  there  is  a  fair  amount  of  noise  in  the  data.  One  does  not  need  to  carry 
around  the  additional  noise,  because  though  it  will  affect  variances  etc.,  it  will  not  affect  the  power 


^^L{x)  is  said  to  be  slowly  varying  (e.g.  Embrechts  et  al.  1997,  p. 564)  if 

lim   L  (tx)  /L  (x)  =  1  for  all  t  >  0. 

X — ♦CX) 

Prototypical  examples  are  L  =  a  and  L  (x)  =  a  In  x  for  a  non-zero  constant  a. 

This  makes  the  use  of  the  kurtosis  invalid.    As  the  theoretical  kurtosis  is  infinite,  empirical  measures  of  it  are 
essentially  meaningless.  As  a  symptom,  according  L6vy's  theorem,  the  median  sample  kurtosis  of  T  i.i.d.  demeaned 

variables  ri,...,rT,  with  kt  —  {YlT=i^i/'^}/{'lli=i'''f/Tj  ,  increases  to  +oo  like  T^/^,  as  the  sample  size  T 
increases.  The  use  of  kurtosis  should  banished.  As  a  simple  diagnostic  for  having  "fatter  tail  than  from  normality", 
we  would  recommend,  rather  than  the  kurtosis,  quantile  measures  such  as  P  {\{r  —  (r))  /ar\  >  1.96)  /.05  —  1,  which  is 
positive  if  tails  are  fatter  than  a  normal. 
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law  exponent.  Power  law  exponents  carry  over  the  "essence"  of  the  phenomenon:  smaller  order 
effects  do  not  affect  the  power  law  exponent. 

For  instance,  say  a  theory,  for  instance  ours,  gives  a  mechanism  for  R,  with  (^/j  =  3.  Other  things 
are  going  on,  so  that  in  reedity,  we  observe: 

Ut  =  (HtRit  +  bit 

For  instance,  the  liquidity  of  the  market  varies,  so  that  aft  is  random,  and  news  can  affect  prices  bn 
without  affecting  volume.  But  even  then,  we  will  have  (^^  =  Cfi  =  3  if  a,  6  are  smaller  order  effects, 
i.e.  have  thinner  power  laws  (CaiCb  ^  3).  If  the  theory  of  Ra  capture  the  first  order  effects  (i.e. 
those  with  dominating  power  law),  its  predictions  for  the  power  law  exponents  of  the  "noised  up" 
empirical  counterpart  ru  will  still  be  true. 

Proof.  See  Breiman  (1965)  and  Gnedenko  and  Kolmogorov  (1968)  for  rigorous  proofs,  and 
Sornette  (2000)  for  intuitive,  heuristic  derivations.  Here  we  just  indicate  the  proofs  for  the  min  and 
max  operations.  For  Cmin{x,Y)  >  we  have: 

P  (min  {X,Y)  >  x)  =  P  {X  >  X  andY  >  x)  =  P  {X  >  X  )  P  [Y  >  x) 
k     k'  kk' 


x'^^  x^y       a;Cx+Cv 
Also, 

P(max(X,y)  >x)  =  l  -  P(max(X,y)  <x)  =  l-P{X<xandY<x) 
=  1-P{X  <x)P(Y  <x) 

\  xi>^  J    \  X'^y  J  a;nim(Cx,Cy) 

where  k"  =  k  ii  Cx  <  Cy,  k"  =  k'  if  Cx  >  Cv,  and  A;  =  fc  +  k'  if  Cx  =  Cy-   ■ 
This  generalizes  to  K  independent  power  law  variables: 

C.Xi+...+Xk  =  Cxj-...-Xk  =  Cmax(Xi,...,A-K)  =  min (Cxj ,  ••■,  Cxk)  (55) 

Cmin(A:i,...,XK)  =  Cx,  +  ■■•Cxk 

Finally,  a  useful  formula  is: 

Cx°  =  —  for  a  >  0  (56) 

a 

The  proof  is  the  same  as  in  Proposition  2. 

9     Appendix  B:  Generalization  of  Theorem  3  to  non  i.i.d.  set- 
tings 

We  sketch  here  how  our  main  results,  including  Theorem  3,  extend  without  difficulty  to  non  i.i.d. 
settings.  For  instance,  there  can  be  autocorrelation  in  market  liquidity  and  trade  arrival.  Also, 
different  traders  can  have  different  trading  frequencies.  We  index  large  traders  by  a.  We  say  that 
Tat  =  1  if  a  does  trade  at  time  t  in  that  interval.  Tat  =  0  otherwise.  So,  the  total  volume  and  returns 
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created  by  the  fund  are  respectively: 


Vat  =  TatkatS'-  (57) 

rat  =  ±TatKtV^P  (58) 

for  some  possibly  time- varying,  correlated  prefactors  kat,  ^at-  While  the  paper  is,  for  simplicity, 
i.i.d.,  this  formulation  allows  the  prefactors  kat,  ^at  and  other  prefactors  to  be  correlated  between 
stocks  and  across  time.  For  instance,  a  period  with  high  kat  and  low  Xat  has  large  typical  volume 
and  low  price  impact.  This  captures  the  presence  of  time- varying  liquidity  and  volume  that  have 
been  detected  in  empirical  data  (see  e.g.  Lobato  and  Velasco  2000). 
The  aggregate  volume  and  return  are: 


Qt''=Y,^at  (59) 

a 

E^°*  (60) 


as  each  trader  creates  a  volumes  Vat  and  moves  prices  by  an  amount  rat-  They  correspond  to  the 
observed  return  and  volume,  simply  called  rt  and  Vt  in  the  paper. 

We  need  to  show  that  Cr-^ss  =  3  and  Cq-^ss  =  3/2  in  that  context  also.  Showing  this  is  a  bit 
involved.  The  difficult  part  is  to  show  that  Vat  and  Vat  have  respectively  power  law  3  and  1.5.  The 
general  properties  of  power  laws  make  r^  ®^  and  Q^  ^®  inherit  the  power  laws  of  rat  and  Vat,  i-e. 

Cr^Sg    =   2^Qagg    =    3. 

We  now  proceed  to  show  rat  and  Vat  have  respectively  power  law  3  and  1.5.  The  result  is  stronger 
than  the  one  in  the  body  of  this  paper: 

Proposition  11  We  index  funds  by  a,  and  dates  by  t.  We  call  9a  the  Hrading  style"  of  fund  a, 
and  Eat  indexes  market-wide  and  fund-specific  time-varying  characteristics.  We  assume  (i)  fund 's 
sizes  follow  Zipf's  law;  (ii)  the  price  impact  follows 

\rat\  —  '^atVat     > 

(Hi)  fund  a,  of  size  Sa,  trades  in  volumes 

Vat  =  katS  "; 
(iv)  fund  a 's  total  transaction  costs  are  equal  to  CaSa  ■   We  assume  the  following  stochastic  structure: 

Aot  =  ^{Sa,£at) 
kat  =  k{da,£at) 
6a  =  6{ea) 
Ca  =  C{ea) 

for  some  functions  X,  k,  6  and  C  with  compact  support  in  (0,  oo),  and  9a,  £at  o^'re  (possibly  multidi- 
mensional) random  variables,  independent  of  the  sizes  iSa)a==i  n>  identically  distributed  across  a. 
(9a)a=i  „  ond  {sat)fa=\  n  '^'"'^  independent,  but  the  Eat,  assumed  to  be  stationary,  can  be  correlated 
across  t  's  and  a 's.     The  probability  that  fund  a  will  trade  at  t  has  the  form 

Fat  =  G  {Sa,  9a)  H  {9a,  £at) 
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so  that  it  is  dependent  on  shocks  Eat  o,n,d  the  size  Sa  but  in  a  form  that  is  weakly  separable.  Then,  the 
distribution  of  the  individual  (before  aggregation)  volumes  and  returns  follow  power  law  distributions 
with  exponents: 

Cq„,  =  3/2  and  Ct-^,  =  3. 

We  comment  on  the  hypotheses.  There  are  four  main  improvements  upon  the  text,  which  assumes 
constant  functions  X,k,5  and  C,  and  Fat  =  G{Sa)- 

Firstly,  5a  and  Ca  need  not  be  the  same  across  funds:  they  can  depend  on  the  "trading  style" 
6a  of  fund  a,  though  we  keep  the  assumption  that  they  are  independent  of  the  size  Sa-  Secondly, 
the  probability  of  trading.  Fat,  the  volume  traded  Vat,  and  the  price  impact  \rat\  can  vary  from 
trade  to  trade.  This  can  reflect  changing  "market  conditions,"  such  as  the  liquidity  1/A.  For 
instance,  the  probability  of  trading  Fat  could  increase  with  the  probability  of  trading.  Thirdly,  we 
allow  intertemporal  correlation  in  the  noise  structure  via  Eat-  This  allows  for  autocorrelation  in  the 
trading  frequency.  Fourthly,  the  probability  of  trading  Fat,  the  volume  traded  Vat  =  S^''k(Oa,eat), 
and  the  price  impact,  \rat\  =  A  [9a, Eat)  Vat  '^^^  t)e  correlated.  For  instance,  the  volume  k  could  be 
higher  when  the  liquidity  1/A  is  higher.  The  fact  that  Eat  is  multidimensional  allows  a  large  class 
of  imperfect  correlations:  one  component  of  Sat  can  be  a  market-wide  term  dependent  on  t,  and 
another  can  be  an  idiosyncratic  term  dependent  on  a  and  t.  Also,  the  price  impact  can  vary  on  the 
"nature"  of  the  fund,  represented  by  9a  (for  instance  degree  of  impatience  represented  by  jj,  in  the 
paper  can  be  part  of  a  multidimensional  6a)- 

The  crucial  hypothesis  is  that  the  distributions  of  {5a,kat,Xat,Ca)  are  "scale  invariant,"  i.e.  do 
not  depend  on  Sa-  The  essence  of  the  proof  is  similar  to  the  one  in  the  paper,  though  the  notations 
are  more  involved.  The  proof  is  fairly  technical,  and  would  be  too  long  to  reproduce  here.  It  is 
available  from  the  authors  upon  request. 

10     Appendix  C:  The  model  with  general  trading  exponents 

We  present  here  the  general  structure  of  the  model.  It  allows  to  see  which  assumptions  are  crucial 
to  get  the  "cubic"  exponents.  Also,  natural  quantities,  the  "exponents  of  trading" ,  emerge.  Their 
direct  measurement  is  an  interesting  task  for  future  research.  We  call  C,s  and  Cl  the  Pareto  exponents 
of  the  large  traders  and  the  liquidity  providers,  respectively.  We  make  the  following  Ansatz  for  the 
behavior  of  our  variables  in  the  large  S,  |  Ap|,  and  T  limit: 

•  A  large  trader  of  size  S  will  make  large  position  of  size  V  =  S^  aX  an  annual  frequency  5~*. 

•  If  a  large  traders  makes  an  order  at  a  price  concession  Ap,  then  the  frequency  of  arrival  of 
liquidity  traders  is  /  =  Ap",  and  each  liquidity  provider  of  size  s  supplies  a  number  of  shares 
q  =  a-  Ap^. 

•  A  trader  of  size  S  will  adjust  the  trading  frequency  so  as  to  pay  a  proportional  amount  of 
transactions  costs  5*^. 

•  The  large  trader,  in  a  given  trade  of  size  V,  wants  to  minimize  a  loss  function  Ap+^T^ where  T 
is  the  mean  time  needed  to  find  enough  liquidity  providers  to  complete  the  trade.  For  instance 
r  <  1  would  correspond  to  a  concave  arrival  time  of  information  to  the  market. 

The  original  model  corresponds  to  a  value  of  0  for  a  and  n,  and  a  value  of  1  for  C,s,  Cl,  0,  and  r. 
The  next  Proposition  describes  the  results. 
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Proposition  12    Under  the  above  conditions,  which  generalize  the  model  in  the  text,  the  exponents 
of  returns,  volumes,  individual  trades,  and  number  of  trades  are: 

C.  =  ^(^+a  +  ^)  (61) 

Cg  =  ^  (62) 

C,  =  C.  +  (a-l)(^|^  (63) 

The  measured  price  impact  function  is  Ap  ~  V  with 

^  =  {a  +  P)r  +  l  (^') 

and  the  exponent  of  the  trading  frequency  is: 

0  =  5(l  +  7)-«:-l  (66) 

Hence  the  exponents  of  returns,  volume  and  number  of  trades  increase  with  the  exponent  of  the 
distribution  of  traders  Cs-  Somehow  forcing  large  traders  to  trade  less,  other  things  equal,  is  going 
to  decrease  the  fat-tailness  of  returns,  i.e.  increase  C,r-  Decreasing  the  time  pressure  to  trade,  in 
the  sense  of  lowering  r,  also  decreases  the  fat-tailness  of  volumes.  Equation  (65)  shows  how  to  get 
values  for  7  different  from  1/2,  the  value  we  mostly  used  in  the  paper. 

Proof.  The  probability  that  a  target  volume  ^  is  >  a;  is: 


Js^>x 


so  that  the  distribution  of  target  volumes  is  (62). 

After  a  time  t,  the  large  trader  has  received  on  average  ~  Ap°''^^t  shares.    So  the  mean  time 
needed  to  realize  a  trade  of  size  \^  is  T  =  h'V/Ap"'^^,  and  the  objective  function  is: 

min  Ap  +  fih'  - 


which  yields  Ap  ~  yi'with  7  given  by  (65).  This  gives  (61)  by  (56). 

Each  transaction  costs  VAp{V)  ~  F^+t  ~  5'5(t+i)^  go  the  annual  amount  will  be 

g-'P  .  Ci5{l+7)  _  5^1+'^ 

which  gives  (66). 

The  number  of  trades  is  n  ~  V/Ap^,  which  gives 


n  ~  1/ c^+V+i .  (67) 


and  (64)  by  (56). 
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An  individual  trade  >  x  involves  a  target  volume  V  >  x,  and  a  fraction  equal  to  P  [siAp^  >  x) 
of  the  n  (V)  ~  V/Ap^  liquidity  traders  who  meet  his  demand.  So: 


P (gi  >  x)  ~  /       V-i-^"  ■n{V)dV-   f 

Jv>x  Jsi 


ApiV)^>x 


I 

Jv>x 


V-'-'^y^(^)  dV 


=  X~^^    [         y-CQ-/37+a-/3'7 
Jv>x 

^-CQ-(a-i)(i-^7) 

so  that  C,  =  Cq  +  (Cl  -  1)  (1  -  Pi),  which  gives  (63).    ■ 

11     Appendix  D:  A  model  for  the  linear  supply  function  (38) 

We  present  a  possible  model  equation  (38),  where  a  liquidity  provider  of  size  Sj  gives  qi  for  a  price 
concession  Ap.  As  we  try  to  keep  the  theory  modular,  the  sub-model  is  essentially  independent  of 
the  rest  of  the  theory.  Oiu  model  is  a  simple  version  of  the  inventory  cost  models  of  e.g.  Grossman 
and  Miller  (1988)  and  Stoll  (1978).  For  simplicity,  we  drop  the  subscript  i  in  this  Appendix.  We 
use  a  very  stark  model  to  side-step  the  complexities  identified  by  O'Hara  (1995). 

The  liquidity  provider  has  a  net  assets  s,  and  a  mean  variance  objective  function  on  his  returns 
at  some  horizon'*^  H,  with  a  zero  riskless  rate. 

U==E  [Wh]  -  ^varWu 

The  factor  pjs  ensure  that  the  risk  taking  behavior  is  independent  of  wealth,  as  C/  is  homogenous 
of  degree  1  in  (Whi  s).  If  he  holds  A  shares,  with  expected  return  m  and  volatility  ct,  his  utility  is: 


U* 


W  +  H  {mA  -  |-^^^^)  (68) 


We  will  assume  that  he  is  allowed  to  supply  the  liquidity  once  before  he  can  seU  it  again,  costlessly, 
to  the  market.  On  average  it  takes  a  time  L  to  do  this.  We  assume  that  he  cannot  rebalance  his 
portfolio  in  the  meantime.  Formally,  we  could  model  L  as  the  average  time  before  some  other  large 
liquidity  provider  arrives,  who  would  absorb  liquidity  costlessly.  It  could  also  be  the  amount  of  time 
to  sell  the  block  at  essentially  no  cost. 

At  the  optimum,  the  liquidity  provide  holds  A*  share  of  the  asset.  By  optimization  of  (68) 

A-  =  ^. 

If  he  supphes  q  shares  to  the  trader,  he  will  hold  A*  —  q  shares  between  times  0  and  L  and  the 
optimum  quantity.  A*,  between  times  L  and  H .  He  pockets  the  price  concession  gAp.  So  his  utility 
is: 

U**  =  W  +  qAp  +  L\m  {A*  -q)-  -^a^H  (A*  -  qf]  +  {H  -  L)  ImA*  -  y'''^^*'^] 


^*  Formally,  this  could  be  a  Poisson  time  at  which  his  portfolio  is  evaluated. 
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We  assume  that  the  large  trader  has  full  bargaining  power,  so  that  the  liquidity  provider  must  be 
just  indifferent  between  supplying  the  shares  or  keeping  them.  This  happens  if  U**  =  £/*,  i.e. 

Ap  =  pa'^Lq/s  (69) 

So  we  get  the  supply  function  (38)  with 


pa^L 

This  means  that  the  liquidity,  a,  is  higher  when  assets  have  low  volatility  a,  short  resell  time  L,  and 
when  the  effective  risk  aversion  p  of  the  liquidity  providers  is  low.  Those  are  the  traditional  features 
of  inventory  models. 

The  argument  above  holds  much  more  generally.  Costs  of  a  suboptimal  holdings  are  second 
order,  hence  equal  to  Kq^  for  some  constant  A.  The  excess  cash  pocketed  by  the  liquidity  provider, 
gAp,  has  to  compensate  that,  so  that:  qAp  =  Ag^,  and  Ap  =  Ag.  The  form  Ap  =  \sq  comes  from 
the  scale-invariance  in  s. 

12     Appendix  E:  Algorithm  for  the  simulations 

In  the  period  Ai  considered,  there  will  be  J  such  "trading  rounds"  where  a  big  trader  creates  one  or 
more  trades.  Each  round  j  has  a  sign  Ej  =  ±1,  and  gives  rise  to  r^,  Vj,  rij,  VJ,  and  n  .  We  define: 

J  J 

We  have  E  [rij  \  Vj]  =  K      up  to  numerical  prefactors.  In  reality,  empirical  returns  the  r  above,  plus 
some  noise  due  e.g.  to  news  and  other  market  events  outside  the  model.  However,  those  extraneous 
events  do  not  affect  the  large  events  and  do  not  affect  our  graphs  in  major  ways. 
One  simulates  T  time  intervals  indexed  by  t  =  1, ...,  T  in  the  following  way. 

1.  One  draws  an  integer  value  Jt  drawn  at  random,  as  the  integer  part  of  lOe^' ,  with  gt  a  standard 
normal. 

2.  One  draws  Vt  =  e'^"'  and  rjt  =  e^"'  with  u,  v  i.i.d.    standard  normals.    For  j  =  \...Jt,  one 
draws  Qj  with  C,q  =  3/2,  and  Ej  =  ±1  with  probability  1/2,  and  sets: 

rj  =  EjVtVt^Q]''^  (71) 

Nj  =  i^mQ]'^  (72) 

iV;  =  ejN,  (73) 

Note  that  vt^ilt  are  constant  across  the  Jt  rounds  j  of  the  time  interval. 

3.  Applying  (70),  one  gets  a  five  values  (r,  Q,  N,  Q',  N')  which  are  stored  as  (r^,  Qt-,Nt,Q[,  N[) . 
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One  calculated  the  various  jB  [y  |  X]  graphs  on  the  data  set  made  up  of  those  values  T  vectors 
{rt,Qt,Nt,Q't,NI.).  To  facilitate  comparisons  across  stocks,  one  rescales  the  variables  -  demean 
them,  and  divide  them  by  their  standard  deviation  if  this  standard  deviation  exists'^®,  otherwise  by 
their  absolute  mean. 

Equations  (71)  and  (72)  are  intended  to  be  the  analogues  to  the  theoretical  (44)  and  (46)  resp. 
r  and  N  vary  like  Q^^'^,  but  there  are  varying  "market  conditions"  vt  =  {as)~  (a  measure  of  the 
"depth"  of  each  liquidity  provider  of  the  market  at  the  given  point  in  time),  and  rjt  =  /j}^'^  (the 
arrival  rate  of  liquidity  providers) .  While  those  quantities  must  have  some  randomness,  we  do  not 
have  strong  prior  on  the  extent  of  their  randomness,  and  any  way  they  play  only  a  minor  role  in 
the  model.  So,  largely  arbitrarily,  we  chose  make  them  equal  to  c^",  with  u  a  standard  normal. 
The  .5  factor  captures  a  relatively  small  randomness,  but  the  results  are  largely  insensitive  to  that 
choice.  Finally,  the  value  of  J  around  10  was  so  that  the  average  number  of  trades  in  a  given  half 
hour  match  roughly  the  empirical  one,  without  fmrther  effort  to  calibrate  the  model.. 

13     Appendix  F:  Confidence  intervals  and  tests  when  a  vari- 
able has  infinite  variance 

13.1  Construction  of  the  confidence  intervals  for  Figure  7 

In  a  given  bin  conditioned  hy  Q  =  Qi,  with  k  elements  rl,...,r^,  the  point  estimate  of  £■  [r'^  \  Q  =  Qi] 
is  naturally  the  sample  mean  of  the  r|,  which  we  call  m.  Getting  a  confidence  interval  for  m  is 
delicate,  as  r^  has  infinite  variance,  so  the  standard  approach  relying  on  asymptotic  normality  is 
invalid.  But  the  theory  of  self-normalizing  sums  of  Logan  et  al.  (1973)  shows  that  if  fi  is  the  true 
mean  and  (Jr^  is  empirical  standard  deviation  of  the  rj  in  the  bin,  then  Student's  t  =  k^^^  {m  —  fi)  ja 
tends  to  a  non-Gaussian  distribution  that  depends  only  on  the  tail  exponent  of  r^,  which  is  2^^  =  3 
in  our  case,  and  the  fact  that  r^  is  a  positive  variable.  So  a  95%  confidence  interval  is  ttij  ±  Ar? 
with  Ar?  =  x^r?  kr^^"^  for  some  real  number  x-  Monte-Carlo  simulations  show  x  =  3.23.  Thus 
we  can  use  the  standard  procedure  for  confidence  intervals,  with  a  factor  x  —  3-23  rather  than  the 
Gaussian  x  =  1-95. 

13.2  Test  of  the  Hnear  relation  E[r^  \  Q]  =  a  +  /3V 

For  each  bin  Qi  of  Q,  we  set  rf  =  £  [r^  |  <5  =  Qi] ,  and  Arf  equal  to  the  half- width  of  its  95% 
confidence  interval  [rf  —  Ar?,r?  -I-  Ar?].  By  weighted  least  squares,  with  weights  (Ar|)  ,  we  fit 
an  affine  relationship  E  [r'^  \  Q]  =  g  (Q),  with 

ff  (Q)  =  0.90 -I- o.3iy 
(0.057)     (0.013) 
R'^  =  0.98 

The  standard  errors  are  in  parentheses.  We  find  that  for  all  values  Qi  >  3,  the  predicted  value  g  (Qi) 
belongs  to  the  95%  confidence  interval: 

giQi)e[rf-Arf,r^  +  Arf]. 


'So  we  took  the  first  moment  for  V,r^  and  V/N,  and  the  second  moment  for  N,N',r  and  N'/N. 
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We  conclude  that,  at  the  95%  confidence  level,  we  cannot  reject  the  linear  form  B  [r-^  |  Q]  =  g  [Q] 
for  Q  large  enough,  here  Q  >  3. 
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